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AA New Proof of the Bieberbach Conjecture 
for the Fourth Coefficient 


Z. CHARZYNSKI & M. SCHIFFER 


Introduction 


In 1955 GARABEDIAN & SCHIFFER [/] proved that the fourth coefficient in 
the Taylor series of a function f/(z), univalent in the unit circle, satisfies the 
inequality |a,| <4. The argument consisted in two basic assertions: a) It was 
shown by means of the calculus of variations and by use of LOEwWNER’s differ- 
ential equation that the problem of finding an extremum function for the 
Bieberbach problem is equivalent to a nonlinear boundary-value problem for 
a system of ordinary differential equations. It was proved also that the Koebe 
function led to a solution of this boundary value problem and that no other 
solution could lie in a very small neighborhood of this particular solution. 
b) There remained then the task of showing that outside of this critical neigh- 
borhood there could exist no’solution leading to an extremum function of the 
Bieberbach problem. This was achieved by a laborious and complicated set of 
estimates and inequalities. 

While the approach of that paper shows in principle a general method of 
attack upon coefficient problems for schlicht functions, the amount of labor 
involved in this relatively simple question seemed discouraging, and it was 
doubtful whether higher coefficients should be dealt with in a corresponding 
manner. The purpose of the present paper is to give a great simplification of 
the proof that |a,| <4 and to show at the same time that more elementary 
methods are sufficient to establish the theorem. We start from an inequality 
implicit in a paper of GRUNSKy [2] of 1939. This inequality can be derived by 
elementary means, using the Schwarz inequality and standard methods of 
contour integration. Below we give a new proof of the Grunsky inequality 
based on variational methods, both in order to make this paper self-consistent, 
and also since the new proof seems somewhat shorter than GRUNSKy’s. No 
use of LoEWNER’S differential equation is necessary. We hope that our present 
proof for the Bieberbach conjecture for a, will encourage the attack upon higher 
coefficients and stimulate new interest in this beautiful field of function theory. 


§ 1. The fundamental inequalities 
4. We shall be dealing with extremum problems within the family © of all 
functions /(z) which are regular and univalent for |z|<1 and which possess a 
Taylor series 
(4) fle) 2 +p dy 2? Gs 2? sb a eo a 
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Let w=/(z) yield the image domain D of the unit circle, and let wy, be an arbitrary 
point in the w-plane but not in D. There exist for arbitrarily small values of 
0 >0 infinitely many members of © and of the form [3] 


2 v\2 

(2 (2) = 1) +e +00), lal <4, 
where the correction term O(0*) can be estimated uniformly in each closed sub- 
domain |z|<7<1. These functions /*(z) may then serve as comparison functions 
with the original member /(z) in variational problems regarding the family ©. 

We shall be interested in maximum problems concerning the first three 
coefficients a), @,, and a, in (1). Therefore it is necessary to express the change 
of these coefficients under the variation (2). An easy calculation yields 
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Now let F=F (ag, a3, a4) be an analytic function of the three complex variables 
ay, a3, and a4. F is then a functional of f€ S with the corresponding variational 


formula 


F 
* ag? | OF i) 1 of Sa a ac si 
F F a ao (2a, + a5) + Fag 2"? 1 sana 
(4) OF OF \ 4 OF 1 
— bey ee pe Sete Olo?). 
ae Daas a Wo | Oa, we (0%) 


Having chosen a fixed functional F(a, a3, a4), we may ask for the solution 
of the extremum problem 


(5) Re {F} = max within the family ©. 


The existence of extremizing functions in © follows from the compactness of 
the family. We can characterize the extremum functions /(z) by the fact that 
under all variations (2) the inequality Re {F*} < Re {F} must hold. In other words, 


refit [a+ 24 Gl eow 


we Wo we 
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must hold for every admissible variation (2). 
The basic lemma of the calculus of variations within the class G leads now 
to the conclusion that D is a slit domain bounded by analytic arcs J” with a 
parametric representation w= w(t) satisfying the differential equation 
2 
(7) (Ge) we [4t2+S Zan 
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ee 


The Bieberbach Conjecture 189 


So far, every extremum problem (5) can be brought within the characteri- 
zation (7). It is, however, extremely difficult to draw decisive conclusions from 
(7) and to determine precisely the extremum function sought. Indeed, the 
coefficient problem of univalent functions is contained in this particular problem. 
Since it has not been possible to find a satisfactory general solution of (7), we 
may now ask for particular functionals F for which the complete answer can 
be given. It is clear from the form of (7) that the integration of the differential 
equation will be simplified if the term in brackets is a perfect square. We ask, 
therefore, for those functions F(a2, a3, a4) for which 


ole 5 Or EP | POOF, OF | oF 
(EB? =4AC, ie., (34 ye oo alee ee ee 
This condition is a simple first order partial differential equation for F. We 
can give immediately a particular integral of it which is in polynomial form 
and which contains an arbitrary constant of integration /. Namely, 


(9) F(a, 43, @4) = a4 — 22 43 + t$.a3 + 21 (as — $3) + Pag. 


Conversely, if we start with the functional (9) and ask for the solution of 
the extremum problem (5), we arrive at the differential equation for the boundary 
curve I’: 

dw\2 14 
(10) Gak 


wm 


par es << (Oe 
2 Ww 


Let w=f/(e'") be a particular parametric representation for I. Now 


dw 


Hence, if we denote 
2 7/ 2 / 2 
(12) SE oe ee 2), 


Reais 2 f(z) 


we obtain in q(z) an analytic function of z in the unit circle which is regular 
there except for a pole of order 3 at the origin and which has non-negative values 
on the boundary. Hence, by ScHwarz’ reflection principle, ¢(z) is rational in 
the entire z-plane and has the form 


(13) q(z)=— 44 2 = myst Bz Ly g2 + 23, y real. 


We can easily factorize g(z). It can vanish for |z|<1 only at the point 2 
for which 


(14) f (Zo) (2+ 34s) =— 1. 
If we put 2,=0 e’”, the point Zt =e! will likewise be a zero of ¢(z). Obviously, 


both points are double zeros of g(z). Each root of q(z) on the unit circle must 
be double, since g(z) does not change sign on this curve. Hence 


(15) 8 q(2) = A(z — e)? (2 — 0 7) (2 — Zein) 


13* 
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if a root 2) of (14) lies in the unit circle, or 
(15’) 23q(z) = A2(z — e*”)? (z — e®)2 (z — et ?)2 


if all roots of g(z) lie on |z|=1. 
In any case, in view of (13) we can assert that 


(16) Va(2) = 


and that this expression is real for |z]=1. Hence we may conclude that x=1, 
w=, and we may put (12) into the form 
2F() 
i(2)' 
We insert into the left-hand side of (17) the series development (1) of /(z) 
and compute the development of this term at the origin. We find that 


#2 Z*-- 92-1 
3 


Z? 


(17) (¢+ $a.) +1)=2 Ltt jez. 


<f fea [it (+ pai] =! let (ja— gat 2 lag) a? + 


8 2 
ir = 3 ay — 3.905 -+ Sah +1 (5a — 3 a3) gel bareey, 
Comparing coefficients in (17) and (18), we obtain 
l= ji 
(19) (ag — 2.03) + lay = 2u 


3,3 3 72) __ 2 
Ay — 2a, dg + t3a5 + 1(a, — fag) = §. 


We multiply the second equality (19) by /=g and add it to the third equality. 
We arrive finally at the result 


(20) F = a4 — 2a, a, + 43.43 + 21 (as — 203) + Pa, =%+ 2|/?. 


The right-hand side of (20) depends only on the parameter /, which we can 
choose arbitrarily at the beginning of the inquiry. The left-hand side is the value 
of the functional in the extremum case. Hence we find the general coefficient 
inequality, valid for all functions of the family S: 


(21) Re {ay — 2a5 43 + $3.43 + 21 (a3 — 2a§) + Pa} <2 + 2|I]?. 


This inequality is fundamental for our further applications. It asserts a 
relation between a quadratic expression on the left and a hermitian expression 
in / on the right. It thus reminds one of the important inequalities due to 
Grunsky [2] for coefficients of univalent functions. It can, indeed, be derived 
from the Grunsky inequalities. This fact is of great methodological interest since 
GRUNSKyY’s inequalities are derived without use of variational methods and are 
closely related to the classical area theorem. We gave the above direct proof 
of (21) in order to show the significance of the Grunsky inequalities in the theory 
of variations, where they are closely related to the problem of differential equations 
containing perfect squares [4]. 
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2. We shall later need a simple inequality for the coefficients in G which 
follows from the area theorem. Consider 


(22) ee - 2 ee 
ce A) 


Clearly, g(z) is regular and univalent for |z|>1, and an easy computation yields 


e b= — Bay, dy =—4(ay— 2a). 
We have the area theorem: 

ey 12 |B)? +3]05?+5[0,)2 + ---. 
The combination 

(25) A= az, — 2a3 


will occur frequently in our calculations. We derive for it from (23) and (24) 
the inequality 


(26) 3|A|? 5 4 — |a,|?. 


§ 2. Proof that |a4| 34 


1. We wish to show now that the inequalities (1.21) and (1.26) imply the 
Bieberbach inequality for the fourth coefficient. We may assume without loss 
of generality that 


(1) Giz 0 Reig.) = 0; 


since this case can always be achieved by replacing f(z) by the new member 
Bie 6 "fie" 2). 

We set 
(2) A=E+1n, G2=2—x+1y, OS De 


We write (1.21) with / as a real parameter at our disposal: 
(3) Ay 2+ Re {2a. A} + Py Re {a3} — 21 Re {A} + 1? (2 — Re {a,}). 


We specialiez by selecting / so as to give the minimum value to the right-hand 
side: 


2 axe} 
(4) ee 
Now (3) becomes 
5 R 7) 2 
(5) ay S = + Re{2a,]} +5 Ream sonata 


We insert here the real and imaginary parts according to the definition (2): 


(6) aS + 2{(2 x) & — yn} + 2 {(2 —x)§ -3(2 x) y=, 
Observe that 
(7) 2(2 — *)§ = 
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Hence 
(8) agS4+(2— 22x —Se+ 3 oe Gate) yey a 
By using the identity 
2 1 

(9) 2yq=a(y 4 1) CS toe 0, 
we may transform (8) into 

3 7 5 5 2 Dike 
(10) a—4s—x-F4+te—(F_Jaly PEP RY 


We use next the inequality (1.26), from which, expressed in the notation (2), 
follows the estimate 


(11) Psgx—sle+y’). 


Hence, we can eliminate 7 from (10) and obtain 


(12) a, 4s(4 1} x (3 ba) a Fi— (242. a 2x) 92 


The final proof of the inequality a,<4 is achieved by two estimates. We 
first choose 


(13) = 3 
and find that 
i ents T(t) (dat 
We can infer from this inequality that 
(15) @=4 when 0S *=54f. 
Next, we select «= 4% and obtain 
(16) iy AS Pi Ge is eee. 


The coefficient of y? is negative for all values OS xX 2; hence, everything depends 
on the sign of 


(17) r(x) =5 —3B x4 Ear, 
One verifies directly that 
(18) n(2heHie tee Al Ta) oy 


Thus 7(x) must be negative in the entire interval }?<*x<2. Hence we have 
proved that 


(19) a,=4 when } 
This completes the proof of the Bieberbach inequality 
(20) |a4| $4. 


2. For the sake of completeness we wish to indicate briefly how the funda- 
mental inequality (1.21) follows from GRUNSKY’s inequality. Let 


(21) gjazt yi 
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be univalent in the circular domain |z|>1. Define the n Faber polynomial 


of g(z) as the polynomial of degree 1 of g(z) for which we have the series devel- 
opment 


(22) G,lelaa +i Pe, 


= 
ll 
= 


It is easily seen that G,,(¢) is uniquely determined by this condition. GRUNSKY 
proved that the set of inequalities between quadratic and hermitian forms, 
fe 


n,m= 


A V2 eg 


N 
<¥ nlx, 
n=1 


are necessary and sufficient conditions that the function (21) be univalent. 


We can apply this result to the particular function (1.22), choose x,=J, 
% g=0, %3=+4, and find precisely the required inequality (1.21). 


This work was performed partially under an Office of Naval Research contract, 
Nonr-225(11), with Stanford University. 
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Global Asymptotic Stability 
for Nonlinear Systems of Differential Equations 
and Applications to Reactor Dynamites 
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Communicated by L. CESARI 


1. Introduction 


In this paper! conditions are obtained under which all solutions of certain 
real nonlinear systems of differential equations tend to zero as too. This 
study originated in some problems of reactor dynamics; however, the systems 
investigated are of a quite general nature. Therefore, the results are given in 
an abstract setting and only in the last section interpreted physically. 


Theorem 1 is concerned with the following general form of the Liénard 
equation: 


(1.1) L+h(t,x,2)e+f(x) =e) (-=dldt). 


Theorem 1. Let the following conditions be satisfied: 


(1.2) A(t, x,z), f(x), e(t) are sufficiently smooth for a local existence and uni- 
queness theorem to hold for (1.1) on OSt<w; —w<x, 2<&. 


(1.3) There exists a constant k>O such that 
RSW; 4,2) (0S 100;-—= <= %, 8 <= 0): 


(1.4) Given any constant B>O there exists a constant Kz>0 such that 
Wt, %,2)= Ke, (OS f= co; |i le | =). 


(1.5) Given any constant B>0 there exists a constant K,>0 such that 
[#(%1) — #(%2)| S Kgl — % (la |, [¥2| SB). 


(1.6) xf(x)>0  (*% +0). 


(1.7) g(x) =Sf(E)dE+>oo as |x| 00. 


0 


u Supported by Lincoln Laboratory, a center for research operated by Massa- 
chusetts Institute of Technology with the joint support of the U.S.Army, Navy, and 


Air Force, by a grant from the National Science Foundation, and by the Oak Ridge 
National Laboratory. 
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(1.8) There exists a constant K>0 such that 
leQ|SK (054 <a). 


(1.9) Sle@lat<e. 


Then given any x9, %9 the solution x(t) of (4.1) satisfying x(0)=x 9, %(0)=%po 
exists on OXt<oo and 
(1.10) limni(t),==.0 , ling (i): =0. 


t—> oo t—0o 


An immediate consequence of Theorem 1 is 


Corollary 1. If 71m addition to the hypothesis of Theorem 1 the condition e(t) +0 
as t->oo 1s also satisfied, then X(t) +0 as t>o. 

As will be seen from the proofs, the uniqueness hypothesis (in all the theorems) 
is not really necessary and is used only to simplify various statements. 

If e(¢)=0, then Theorem 1 may be thought of as a ‘‘global’’ asymptotic 
stability theorem of the trivial solution x(t)=0. That is, all solutions tend to 
zero as t->co and not merely those for which |%9|,|%»| are sufficiently small. 
The presence of the e(¢) term causes, of course, x(t) =0 no longer to be a solution 
of (1.1). This sort of complication of stability problems has been considered, for 
example, in [7, ch. 13]. 

The proof of Theorem 1 is subdivided into three parts, Lemmas 2.1, 2.2, and 
2.3. These establish respectively (under an increasing number of hypotheses) : 
the existence and boundedness (the bound depending on %, %») of x(t), ¥(¢) on 
OSt<oo; *(t)—0 as too; and finally x(t) 0 as too. Lemma 2.1, which 
is of interest in itself, is due to ANTOSIEWICcz [2] in the special case that h is 
independent of ¢. We give a similar proof of this lemma, since the method and 
some of the formulae are basic to all that follows. The method involves the 
same sort of considerations as in the Liapounov second method. However, as 
in [2], one cannot simply cite the classical Liapounov ordinary stability theorem, 
see [3, p. 113], in order to obtain Lemma 2.1, for a global rather than a local 
result is desired. The situation is further complicated here in that the total 
derivative of the energy function from which (2.6) is obviously obtained is not 
negative definite (even for the case e(t)=0, see (2.7)). Thus the classical Lia- 
pounov asymptotic stability theorem, see [3, p. 114], cannot be applied here 
to give even a local asymptotic result. Lemmas 2.2 and 2.3 are used to circumvent 
these difficulties. 

As part of the novelty of Theorem 1 lies in the fact that / is allowed to depend 
on f, it is of interest to focus some attention on hypotheses (1.3) and (1.4) which 
relate to this dependence. That (1.3) cannot be dropped entirely is obvious 
from the example ¥+x=0. It is not quite so transparent that (1.4) cannot 
be dispensed with, especially as (1.4) is automatically satisfied if / is continuous 
and does not depend on ¢. Remarks 2.1 and 2.2, which deal with a linear case 
of (1.1) and are in the spirit of the present methods, exhibit interesting examples 
in which (1.3) and (1.4) are respectively the only hypotheses of Theorem 1 that 
are violated; in both, the conclusion of Theorem 1 does not hold. In connection 
with Remark 2.2 see [4, p. 137]. 
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Theorem 2 is concerned with the system 


n 


x=— »2 Aj 4245 
j=1 
A ealal : 
z,=—Ah,(t, x, 2) 24, +6,f(*) +e, hes Pn) 
where the a, and b,; are constants and z=(%, ...,2,), Which forn=1 is essentially 
(AA) 


Theorem 2. Let the functions h;, f, e, be sufficiently smooth for a local existence 
and uniqueness theorem to hold for (1.11) on 0OSt<co;, —w©<%, 7;<%, Let the 
h, satisfy (1.3, 1.4), # (1.5, 1.6, 1.7), and the e; (1.8, 1.9). Furthermore, let the 
constants a,;, b; satisfy eather 


(1.12) a;=cb,, where c>0 and the b; are arbitrary except that at least one, say 6,, 
1s not zero, 
or 


(1.13) ' g,[bs>0 (6 = 4,-0.0). 


Then given any Xo, 2o, the solution x(t), z(t) of (1.11) satisfying x(0)= Xo, 2(0) = 2 
exists on OXt< oo, and 
(1.14) Him~7.(7) == 0; lim (f) = 0. 


t— oo t— oo 
In (1.3) —o<z<oo means —co<z;<00 (i=1,...,), and |z| in (1.4) means 
lz] = 2 lal 
The proof of Theorem 2 differs only in minor details from that of Theorem 1. 
Therefore, analogues of Lemmas 2.1, 2.2, 2.3 are not stated explicity and only 
the essential formulae involved in the proof are given. An obvious analogue of 
Corollary 1 holds here when e;(¢) +0 as foo. 


Consideration of the linear special case of (1.11) defined by 
(1.15) h(t,:4)2) =hy> 0, \ f(x) ae; (fe, 


where the /; are constants, sheds additional light on conditions (1.12, 1.13). It 
is shown, Lemma 3.1, that the real part of each of the characteristic roots of 
the coefficient matrix associated with (1.11, 1.15) is negative if either (1.12) or 
(1.13) is satisfied. Hence, in the special case of (1.15), Theorem 2 reduces to 
a well-known theorem of Liapounov, see [1, p. 314]. This classical theorem 
may be used together with a standard perturbation technique to obtain a local 
(but not global) asymptotic stability result if, for example, the second condition 
in (1.15) is generalized to f(x)=x-+0(x) as x0. However, a perturbation 
technique is hopeless if, say, f(x) =.x* (which satisfies (1.5, 1.6, 1.7)). 

We remark that the last m equations of (1.11) may be written as 


z=G(i,x,2z)z+bf(x) +e(t), 


where 
Ge a8 hy. a.5 Mp) 5 pe SOON Ay eal 6=col(b,, ...-0,) 1 €=col (ey area 


However, if G is of the form G= — h(t, x, 2) A, where h(t, x, z) is a scalar function 
satisfying (1.3, 1.4) and A is a positive definite real symmetric matrix, then 
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the system is easily transformed into a special case of (1.11). A similar comment 


applies to (1.16) below. This observation is used in the reactor applications of 
Section 5. 


Theorem 3 is concerned with the system 


pe on 
AZ 
(1.16) y = bof (x) + eo(t), 


2,=—/h,(t, x, y, 2) 2, +0, f(x) +; () (Gaia. 


where again the a; and 0; are constants and z=(z,, ..., 2). 


Theorem 3. Let the functions h,, f, e; be sufficiently smooth for a local existence 
and uniqueness theorem to hold for (1.16) on 0OXt<00; —w<x, y, 2;<00. Let the 
h, satisfy (1.3, 1.4), f (1.5, 1.6, 1.7), and the e; (1.8, 1.9). Furthermore, let the 
constants a;, b; satisfy either 


(1.17) a;=cb;, where c>0 and the b, are arbitrary except that b)+0 and at 
least one other b;, say b;, 18 not zero, 


or 


(1.18) a;/b;>0 (= 0, m.8ei)e 


Then given any %q, Vo, %q, the solution x(t), y(t), z(#) of (1.16) satisfying x(0)= Xp, 
y (0) =o, 2(0) = 2 exists on OSt<0, and 


(1.19) linn 97(7) =.0; 1 olimayd?) =.0 034 tims (t) = 0- 

t—>0o t—> oo t—> co 
Here the range of the variables in (1.3) is OSXt<00; —w<4, y, 4;<0, and the 
condition in (1.4) is O<t<oo; |x|, |y|, |z| SB. 

The first part of the proof of Theorem 3 again follows Lemmas 2.1 and 2.2 
in all essentials. However, the last part of the proof, while having some similarity 
to Lemma 2.3, is considerably more involved technically. An obvious analogue 
of Corollary 1 holds here when e;(#) +0 as too. 


Similar to (1.15) we consider the special linear case of (1.16) defined by 
(1.20) h;(t, x, Vy, 2) =e On T(x) =~, e,(t) =0, 


where the /; are constants. Lemma 4.1 shows that the real part of each of the 
characteristic roots of the coefficient matrix associated with (1.16, 1.20) is negative 
if either (1.17) or (1.18) is satisfied. The remarks following (1.15) are equally 
applicable here. 

On comparing Theorems 2 and 3 it might be suspected that more than one 
y type component could appear in (1.16) without causing any change in the 
conclusion of Theorem 3. By means of an example, Remark 4.1, it is shown 
that this is not the case. 
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2. Proof of Theorem 1 
Lemma 2.1. Let (1.2, 1.7, 1.9) as well as 


(2.4) OSht, x, 2) (OSt< 00; — 0 < %,27< 00) 
(2.2) f(x) €C(— cw, ©) 
(2.3) % f(x) 20 (— 00 < * < ov) 


be satisfied. Then given any Xo, %9, there exists a positive constant K=K(Xq, Xo) 
such that the solution x(t) of (1.1) satisfying x(0)=%9, % (0) =%p» exists on OSt<o0 
and satisfies 

(2.4) xO, 2O|SK  (0St< 0). 


Proof. Equation (1.1) is equivalent to the system 
X=—Z, 


(2.5) f= —hlt, x, —22+f(x) —e). 


The present notation is used, rather than the more conventional 4=2%, Only 
because it ties in better with (1.11, 1.16) and in turn with the applications of 
Section 5. 

Let x9, % be given (z9= — %9). Let x= x(t), z=2(t) denote the unique solution 
of (2.5) satisfying x(0)=%9, z(0)=2) in the remainder of this and the following 
two lemmas. Then there exists a 4, =¢,(%9, %) >O such that x(¢), z(¢) exists on 
O2tst. 


Define 
(2.6) E(t) =e(x()) +32) (0StS4), 
where g is defined in (1.7). Differentiating (2.6) and using (2.5) yields 
(2.7) E(t) = — h(t, x(t), — 2(#)) 22(t) —e(t) z(t). 
By (2.1) this implies 
E(t) < — e(t) z(t) <|e(é)| |z(| <|e@| + 2/e()| EW, 
which on integrating yields 


EW) SEO) +flelar+2flet|Eedr  (OStS<4). 


Applying the Gronwall inequality, see [7, p. 37], one has 


(2.8)  E’< {E(0) + fle(x) dz} exp l2 f le(r)| dt 
0 0 

Upon noting that the right-hand side of (2.8) is finite and independent of 4, 

the conclusion follows readily from (1.7, 2.6) and a routine continuation argument. 


. Lemma 2.2. Let the hypothesis of Lemma 2.A as well as (1.3, 1.4, 1.8) be satis- 
fied. Then the solution x(t) of (4.1) of Lemma 2.1 satisfies 


(2.9) Jim ¥ (1) = 0. 
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Proof. In the notation of Lemma 2.1, (2.9) is equivalent to the assertion 


(2.10) lim z(t) =0. 


t—> oo 


Integrating (2.7) and using (1.3) yields 


- t t 
(2.11) OS E(t) S E(0) —k f 2 (t) dt — f e(t) z(t) dt. 
0 0 
By (1.9, 2.4) there exists a K= K(x, 2) such that 
(2.12) E(0) +4 sup le@lt fle(|ar=K<e, 
0<t<oco 0 
and (2.11, 2.12) readily imply 
(2.13) f2(t)dt<o. 


0 


However, (1.4, 1.8, 2.2, 2.4) and the second equation of (2.5) imply the existence 
oLa K=K(x,, %)-such that 


(2.14) 37 20| SK <e (O55 2 = 60). 


By an elementary lemma, see [3, p. 273], (2.10) is an immediate consequence 
of (2.13, 2.14). 


Lemma 2.3. Let the hypothesis of Theorem 1 be satisfied. Then the solution 
x(t) of (1.4) of Lemma 2.1 satisfies 
(2215) lim~(7) =0. 


t—> oo 


Proof. From (2.4, 2.6) it follows that 


(2.16) 0< lim E(t) = E,, <0. 


t— oo 


We now show that the assumption 
(2.17) BO 
leads to a contradiction. 

There exists a sequence {#;} such that 


(2.18) Jim t,= 0, 
(2.19) lim E(t,) = Eq. 
n—> CO 


From (2.6, 2.10, 2.18, 2.19) one easily has 


The condition (1.6) implies that g(x) is strictly decreasing on —o<%S0 and 
strictly increasing on OS %<oo. Hence, by (1.7, 2.16, 2.17, 2.20) there exists 
a finite 

(2.21) % +0 
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and a subsequence of {t;!, which without loss of generality we may take to be 
{¢;+, such that 
(2.22) lim + (4) =. 


n—> CO 


Integrating the second equation of (2.5) yields 


et, +1) —7@)=—f he, x), =20) 20) dt+f(x(t,)) + 
(2.23) ct fel 
+f [f(#(a)) —t(*@,))] 4 c e(t) dr. 


From (1.4, 1.5, 2.4, 2.23) it follows that there exists a finite K=K(%, 2) such: 


that 
ional ijt ¥ eles | 


| f(x (t,))| S lz, +1) —2(,)|+K f MG) Raaa. | Oey} |e(z)| dz, 


and, hence, 
aul 


(2.24) |f(x(¢,))| S$ (2+ 4K) sup |z(4)| aie es |x (z) pid eeehl |e(t)| dz. 


ta<t<oo 


From the first equation of (2.5) we have 


(0) — x(t) = — J 22) dr, 


so that 

(2.25) lx) — x] S sup [2] (StS 4,44) 
ty<t<00 

From (2.24, 2.25) one has 

(2.26) |f(«(é,))| S2U+ & Sap\ (t)| +f e(t)| dt. 

Letting moo in (2.26) and using (4.9, 2.10, 2.18) yields 

(2°27) lim f(x (AVES 

It follows from the continuity of f and (2.22, 2.27) that 

(2.28) f(z) =0. 


However, (2.21, 2.28) contradict (1.6) and, thus, (2.17) is impossible. From this 
it follows that E,,=0, which easily yields (2.15). 

The preceding lemmas establish Theorem 1. It may be noted that Lemma 2.3 
can be proved by a direct argument using essentially the calculations of (2.23) 
to (2.28); however, the above proof is more like that of Section 4. 

Remark 2.1. As already noted in Section 41, the equation ¥+*=0 shows 
that (1.3) cannot be dropped entirely. It is not sufficient to replace (1.3) by 
h>0, as the following shows. 


Let h(t)20 and h(t)€L,(0, c). Then there exist solutions of 
(2.29) K+h(t)e+«%=0 


which do not tend to zero as t->oe. 
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Proof. The change of variables x=&+sint transforms (2.29) into 


E+ n(t)€4£=— hit) cost, 


which is equivalent to the system 


(2.30) E=—y, H=—Alt)n +tE+hA(E) cost. 
By hypothesis there exists a finite t;>0 such that 
(2.34) va h(t ) dt} exp afar )dz| <2. 


Consider the solution §(¢), 7,(¢) of (2.30) that is defined by &, (t,) = (¢,)=0. 
Define 


EQ) =3&) +3m(b. 
Following Lemma 2.1 and using (2.31) 
E(t) =— h(t) 13) + m() h(é) cost h(é) + 2h() El), 


lee) t 


E(t)< fh(t)dt +2 f h(t) E(t) dt (QSt< oc), 


, one obtains 


, l4()| S$ (44 St< oo). 


Hence, the solution %, (¢)=&,(¢)-++sin¢ of (2.29) does not tend to zero as t->oo, 
which completes the proof. 


Remark 2.2. The following shows that (1.4) cannot be dropped entirely. Let 
h()EC(0, 0), h()>0, WEL, 0), and [h() +4] h*) CL, (0, ©). 
Then there exist solutions of (2.29) which do not tend to zero as to, 


Proof. The change of variables ae t), where 


(2.32) = Cxq) |- Ii FAG atl, 
transforms (2.29) into 
E+hQ)E+E=— Gli), 
which is equivalent to the system 
(2.33) g=—4, h=—hOntét+G@. 
From (2.32) and the hypothesis there exists a constant A such that 
(2.34) 0<A< it) (0OSt<oo), 


As 
t 
(+ tlexp|— | na": 
0 


it follows from the hypothesis that there exists a finite 4,>0 such that 


{Jeo dt} exp 2 F180 at = a 


P (t) = ae 
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Consider the solution &,(¢), 7 (¢) of (2.33) that is defined by &(4)=m (4) =0. 
Exactly as in Remark 2.1 we now obtain 


(2.35) Qs (hSt<o). 


Hence, the solution x, (¢)=&,(¢)+ q(t) of (2.29) does not tend to zero as too, 
as is evident from (2.34, 2.35). 


3. Proof of Theorem 2 
First we consider the case that (1.12) is satisfied. Let x9, 2) be given. Let 
x(t), z(t) denote the unique solution of (1.11) satisfying x(0)=% 9, z(0)=%) Then 
x(t), z(t) exists for ¢ sufficiently small Define 
. vad 1 <1 32 
(3.1) E(t) =La(x) +2 40. 
As in Lemma 2.1 we now obtain 


E(t) = {B(0) +f les(e)lachexp|2 FS Jeol ar], 


which yields the existence and boundedness of x(é), z(¢) on OSt<ov, 
Following Lemma 2.2, one obtains 


(3.3) OSES yk I Lae (drt f S exle T) 2;(t) dt 


from, (4.3, 3:2), and 
(3.4) E(0) +1 sup > le(lf f » le(t)|dt=K< x 


O0St<o@ i=1 


=. 
i 


from (1.9) and the boundedness of x(¢), z(t), where K=K(x, 2). However, 
(3.3, 3.4) imply 


(3.5) ies Tt) dt <OOn 
As before there exists a KEK(uys ke such that 
(3.6) rE ae 


From (3.5, 3.6) it follows that 


\SK<0 (0St<ov). 


, 


lim z(t) =0 
t—+ oo 
which is the second half of (1.44). 
The proof of Lemma 2.3 is easily adapted to obtain the first half of (1.14). 
One simply applies the argument of Lemma 2 .3 to the first and 7+ 1** equations 


of (1.11) (here is where b;=£0 is needed). This completes the proof of Theorem 2 
under hypothesis (1.12). . 


Stability of Differential Systems 203 
Under hypothesis (1.13) the proof goes much the same way. One now defines 


3.7) EO = 82) +5 D Hal 


and obtains analogous formulae, for nie 


E(i)s {00 fell YF Je(r) ax} exp 2 s | e;(z)| in| ‘ 


f Soe Tide <0, ls to] aK <x (Ot =400). 
t=1 i=1 ~* 


el 


The adaptation of the proof of Lemma 2.3 is exactly the same as the one men- 
tioned above, except that here any one of the last equations of (1.11) may be 
used. This completes the proof of Theorem 2. 


The special case of (1.11) defined by (1.15) may be written as 


a Oa) =H... x 
a bb —h 0 .... 0 a, 
(3.8) g = 
2 dt bo — by O oa hy POD O bo ? 
Zn Sema ( re n= Z, 


where the ;>0. We denote the coefficient matrix in (3.8) by G. 


Lemma 3.1. If either (1.12) or (1.13) ¢s satisfied, then all the roots of det (AI —G) = 
lie in the left half plane Re A<0. 


Proof. It may be established by induction that 


det(AI—G) _ 1 ae Dp 
n A+ ny A+h, 3 
LL (A+) : 
As the h,>0, it is clearly sufficient to show that £(A)+0 for ReA=ZO. Let 
A=o-+it. Then 


ne Yt nba (othe) 
G9) ae & (o- hy)? +7? 


From (3.9) it is evident that Re #(A)>0 for oZ0 if either (1.12) or (1.13) is 
satisfied, which completes the proof. 


P(A) = 


4. Proof of Theorem 3 


First we consider the case that (1.17) is satisfied. Let %9, Yo, % be given. 
Let x(t), y(t), z(¢) denote the unique solution of (1.16) satisfying x(0)=%, 
y(0)=o, 2(0)=%. Then x(t), y(é), 2(¢) exists for ¢ sufficiently small. By the 
same methods employed in the proof of Theorem 2, one can now obtain the 
existence of x(t), y(t), z(t) on OSt<oo and 


(4.1) “|, |v], 
(4.2) lim 2(¢)=0, 


t—>0o 
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where K=K(Xo, Vo, 0). The analogous formulae are 


ge 
— 
R 
— 
SS 
— 
+ 


(4.3) E(t) =— 


and, for example 


Thus, it remains to show that 


(4.4) linr% (2) = 0; lim y(t) = 0. 


t— oo t—> oo 
From (4.1, 4.3) it follows that 
(4.5) 0< lim E(t) =E,, <<. 
We now show that the assumption 
(4.6) Bee 0 
leads to a contradiction. 


There exists a sequence {#;} such that 


(4.7) limi == 0 
(4.8) lime =e 
From (4.2, 4.3, 4.7, 4.8) one easily has 

: 1 (re ee 


Consider the system 
(4.10) z=—chy, y—bof(x), 
which is equivalent to the equation 


(4.11) K +c b5f(x) =0. 


It is well known that under hypotheses (1.5, 1.6, 1.7) (and c b§>0) all solutions 
of (4.11), and hence of (4.10), are periodic. Let x= q(t), y=w(f) be the solution 
of (4.10) such that w(0)=0, p(0)=)2E.. Let Q>0 be the smallest positive 
period of y(t), p(t). As (4.10) is autonomous, «= (t+), y=y(t+¢@) is a solution 
of (4.10) for all real g. Furthermore, these and only these solutions of (4.10) 
have the same orbit, @, in the x, y plane. The graph of @ is given by 


4.12 a 
a42) EASY Seo. 
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From the nonconstancy, a consequence of (4.6), and periodicity of p(t) it is 
easily seen that there exists a positive constant w such that 
3 1 is) =O) = : 
(4.13) Se | Pa) 9) ae 0 
Let e>0 be given. It is readily seen from (1.9, 4.2, 4.7, 4.9, 4.12), the geo- 
metry of the curves 
ts ree = 
. g(x) + es, const. 2 0, 
and the preceding discussion that there exists a positive integer N, and a real 
number @,=0 such that 


(4.14) |* (tm) —Plim~+e)lSe, |y (tm) —plte +e) Se, 
(4.15) |z(é)| Se (ty, St< oo), 
(4.16) yrs (tee Oem 1 
Define 
(4.17) PH=PLe)=Plt+e), PH=$(Le)=yplt+e). 
As (4.10) is autonomous, one has from (1.16, 4.10) 
4 x) — $0] =—cdoly) — GO] —eDHx (0 


which upon integrating yields 


Fee 22h) 26 ee cy f (0) —PO]de—eS4, f ase), 
(4.18) 7 gs 
(0) BO = yb) — Pll) +b. f [/eeo) HG CO] de + feo) dr. 


From (4.5, 4.1, 4.14, 4.15, 4.16, 4.18) it follows that 


Ix) — POS {1+ 2D [byl hetelbol J lye) Bl ae 


ly) -#O|S 20+ Ky|bol J |x(t) — $(x)| ar, 


where K,=K,(%o, Vo, 2%) > 0 and, in particular, is independent of ¢ and n. On 
adding the two inequalities of (4.19), there results 


t 
|x) —$|+|y® —$O| S Kee + Ke Sf {|x(%) — | +l y (x) — B®) Sat 
tye 
(ty, StSty, +2), 
where K,—=Koy(%o, Vo, 2), Which by the Gronwall inequality implies 


lx) —$()| + |v) —PO|SKeexpH,Qle = (ty, StS ty, + 2). 
14% 
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Thus, for some Kz=K3(%o, Yo, 20), 
(4.20) y(t) —PO|SKee (ty StSty. +2). 


The second and j-+2"4 equations of (1.16) together with (1.17) imply 


Hl) = P08, (t) + 9 y(t, 2, 90,210) (0) — Fell) + ell) 


(4.21) t t se 
+ 5° f iy (e, #2), ¥(2), 2(7)) z(t) dc — 92 f ej(c)de+ | éo(t) dv. 


tie tie ty, 


S 


From (1.4, 4.1, 4.15, 4.16, 4.21) one obtains 


for some K,=Ky(%9, Vo; 2). However, from 


P(t) — P(tw,) = P() — y(t) + vy) — y(t) + ¥ (te) — Bey) 
and (4.17, 4.20, 4.22) it follows that 


(4.23) max |v (t) — w(tn, + @)| S (2Kg+ Ky) e. 


tye-+ 0eStStye+0e+2 
Comparing (4.13, 4.23) yields wu (2K3+ K,)¢, which, as ¢ may be chosen arbit- 
rarily small, contradicts w> 0. 
Thus, it follows that E,,=0, which easily implies (4.4) and completes the 
proof under hypothesis (1.17). 
The proof under hypothesis (1.18) is essentially the same as under.(1.17). 
Instead of (4.3) one defines 


Ei = 3(2)) a i 1 5 a 2219, 


and modifies the calculations along the same lines as in the formulae immediately 
following (3.7). This completes the proof of Theorem 3. 


The special case of (1.16) defined by (1.20) may be written as 


x 6) ag ay Me or oa UE x 

y bp woeeO, dab aeRO y 

(4.24) S A ROP FOS FSA erie ee A 
Zo b, 0 OE nal eon eas) Zo 

Bn b, O 0) O rik 9» a h, zy 


where the 4;>0. We also denote the coefficient matrix in (4.24) by G. 
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Lemma 4.1. If either (1.17) or (1.18) 7s satisfied, then all the roots of det (AI —G) ==0 
he in the left half plane ReA<0. 


Proof. It may be established by induction that 


= i 
=? + aby 4 a; bp = 
2 Athy 


det(AI—G 
p(a) = eH 
IT (A+he) 
k=1 
As the h,>0 it is clearly sufficient to show that #(A)=0 for ReAZo. Let 
A=o-+it. Then 


Rep les ot tea b p, clothe) +? 
foe ep(s) =o T* + ay ot Dim (otha)? +e 


Im p(A) =207+T7 >’ a,b, — Wh 
h=1 


(oFhy) Eo 


From (4.25) it is easily seen that Re (A) and Im #(A) cannot vanish simultane- 
ously for o=0 if either (1.17) or (1.18) is satisfied, which completes the proof. 


Remark 4.1. That more than one y type component cannot appear in (1.16) 
without changing the conclusion of Theorem 3 is shown by: 


Let f(x) satisfy (4.5, 1.6, 1.7). Then there exist solutions of 
Oe ME ety payee =f (4) FS pKa) ais SS ai (e) 


which do not tend to zero as tox. 


Proof. Let x(t), €(t), Z(t) be a solution of 
t=—E—2, E=2f(x), #=—24+f(x). 


Then by Theorem 3 (hypothesis (1.18)) 


s easily shown to be a solution of (4.26), and as 


lim (x*(), yi (2), v2 (4), 2*(4)) = (0, — 1,1, 0) 


t—> oo 
the proof is complete. 


A little reflection shows that the key to the preceding example is the existence 
of the critical points x=0, ¥,= +YVk, yo= = VR on 


g(x) +e vi +32 =k>0, 
which is an integral for the system 


4=—¥— Ye, W=t(*), Ye=T(*)- 
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5. Applications to Reactor Dynamics 
In this section Theorem 2 with »=1 (which is essentially Theorem 1) and 
Theorem 3 are applied to certain problems in reactor dynamics. For earlier 
work in this direction as well as some references to the physical literature see [5]. 
The dynamic equations for a class of homogeneous nuclear reactors are of 
the form 


(5.4) (Seal, STS =TEe TT Se. 

In the case of NEwron’s law of cooling studied previously [3] one has 
(5.2) h(t,u, T\=1, f@)=explul—17) 4) =0, 

with the physical interpretations 


u = logarithm of the ratio of the reactor power to the stationary power 
(w=0 at equilibrium), 
T = deviation of the temperature from the equilibrium temperature 
(5.3) (T =0 at equilibrium), 
€ = heat capacity, 
—a= ratio of the temperature coefficient of reactivity to the mean life 
of neutrons. 


In (5.1) is a heat conduction term and ¢ is a forcing term. 


An immediate consequence of Theorem 2 (1=1) is 


Theorem 4. Let the functions h, f, e satisfy the hypothesis of Theorem 1, and 
let the constants w, € satisfy e>0, a>0. Then given any uy, Ty, the solution u(t), 
T(t) of (5.1) satisfying u(0)=u, T(0)=Ty exists on OSt<co, and 


(5.4) lim w(t) =0, lim T(t) = 


t—> co t—> oo 

It may be noted that Theorem 4 restricted to (5.1, 5.2) generalizes Theorem 3.1 
of [5] in that under the same hypothesis it establishes global rather than local 
asymptotic stability. Further, while the earlier theorem asserted local asymptotic 
stability, the proof was quite incomplete and all that was really established was 
ordinary local stability. Comparing Theorem 4 itself with (5.2), one notes that 
the only relevant property of f=exp[w]—1 is that it represents a “‘spring”’ in 
the usual sense of (1.5, 1.6, 1.7), and that of h=1 is that it represents a ‘‘positive 
damping”’ in the sense of (1.3, 1.4). 

The dynamic equations for a class of heterogeneous reactors of m (m= 2) 
media are of the form 


(5.5) ; a 
eee B) 2X; i(1;— T) +7, fw) + 4; (0) (= 4, 0245.70), 
where T=col(7Z,,..., J},). The special case studied in [5] had 


(5-6) A(t, w, T) = 15) f(a) = exp) ae =o Cea an, 
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with the physical interpretations 


u = logarithm of the ratio of the reactor power to the stationary power 
(uw =0 at equilibrium), 
T; = deviation of the temperature from the equilibrium temperature in 
the 7" medium (T7;=0 at equilibrium), 
€; = heat capacity in the 7* medium, 
(5.7) —a,;= ratio of the temperature coefficient of reactivity in the 7 medium 
to the mean life of neutrons, 
n;— fraction of the power generated in the 7° medium (physically 
Ln=1), 
= thermal conductivity from medium 7 to medium 7 (physically 
X;;=X;,). 


X;; 
In (5.5) his a heat conduction term and the e,; are forcing terms. 


In order to apply Theorem 3 to (5.5) (it will shortly be seen that Theorem 2 
is not applicable here), it is first necessary to transform it to the form (1.16). 
The notation 


CaO (Ory comin ee eg COLI yas Md) ye eC COLC , . 1y Gon) 
é=diag(s,...,&,), jm’ = transpose of wu (for any mu), 


will be used. From (5.5) it is clear that 


X,;} T,+ h(t, u, T) > X;,7T;, +7; (uv) + (2) 


e; T= — h(t, u, T) | 


5.8 7 jt 

ee) (VAR a). 

Define the matrix A =(A,,) by 

(5.9) 18 ee Ay eG! a(t) 
i+t 


Then from (5.8, 5.9) one sees that (5.5) may be written as 


“u=—T'o, 
(5.10) : 
eT=—hit,u,T)AT+7f(u) +e(é). 
We need the following algebraic 
Lemma 5.1. Let 
(5.11) X,,;=X;;>0 (+7), 6 > Onl See 


Then (i) the real matrix A defined by (5.9) 1s symmetric, and precisely one of its 
characteristic values is zero, while the remaining m—1 characteristic values are 
positive; (ii) there exists a real nonsingular matrix R such that 


(5.12) ReR=I, R'AR=D=diag(d,...,d,), 


where I is the unit m by m matrix. Furthermore, precisely one of the d; 1s zero, 
say d,=0, while the rest are positive. 
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Proof. (i) That A is symmetric is obvious from the definition (5.9) and the 
first half of (5. 11). Consider the quadratic form 


(5.13) Wu) =e Ap. 
A direct calculation yields 
wt mM 
Wu) => 2d Xij(Mi— 4)”, 
t=1 j=i+1 


from which, using the first half of (5.11), it is evident that W() is positive for 
all « except multiples of col(1,...,1), for which it vanishes. The statement 
about the characteristic values of A now follows from well-known facts about 
real symmetric matrices. 

(ii) By a well-known theorem, see [6, p. 187], (5.12) is an immediate conse- 
quence of the positive definite real symmetric nature of ¢ and the symmetry 
of the real matrix A. Making the change of variables 1 = R& in (5.13) and using 
the second half of (5.12), one has 


W( w= W(RE) =e RA RE= 54, &, 
i=l 
which together with (i) easily yields the statement about the d;. 
Letting T=RQ in (5.10) yields 


U=— Oa 


(5.14) q 

Q=—hit,u,R Q)DQ+ bf (u) + é (i) 
where 
(5715) @=R' ar be Rag Seog= kK ee, 


which written in component form is 


4=—a,0;—), 40, 
kaj 


(5.16) Q; = 0; f(w) + @; (2) 
Q;=— h(t, u, R Q) d;Q; + 0; f(u) + é; (2) (2 +59). 


Under hypothesis (5.11) the system (5.14), or equivalently (5.16), has the same 
stability properties as (5.10) since R is a nonsingular constant matrix. It is 
immediately seen that (5.16) is of the same form as (1.16). Therefore, we have 
as a consequence of Theorem 3. 


Theorem 5. Let the functions h, f, e; be sufficiently smooth for a local existence 
and uniqueness theorem to hold for (5.5); the h, f, e; satisfy the hypothesis of Theorem ; 
the X;, and e, satisfy (5.11); R be as defined in Lemma 5.1; and a, b be as defined 
by (5.15). Furthermore, let a, b satisfy either 


(5.17) a=cb, where c>0, and b is arbitrary except that b;==0 and at least one 
other b; 1s not zero, 
or 


(5.18) a;/b;>0 (¢=0,1,...,m). 
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Then given any uy, To, the solution u(t), T(t) of (5.5) satisfying u(0) =u», T(0)= Ty 
exists on ONt<oo, and 
(5.19) lin 7) =0, ~~ “lim 7 (7): = 0: 


t—> oo t—>0o 


The remarks following Theorem 4 apply equally well here to the comparison 
of Theorem 5 with Theorem 4.2 of [5]. The verification of hypothesis (5.17) 
or (5.18) is a purely algebraic matter and does not involve solving any differential 
equations. R is determined by the given matrices A and ¢; and then a, b are 
defined by (5.15) in terms of the given vectors «, 7. Hypothesis (5.17) is analogous 
to the crucial hypothesis, (2.6) of [7], used in the analysis of the integrodifferential 
equations for a certain continuous medium reactor. 


We note that if 
(5.20) a=cn (c>O0), 
then from (5.15) it follows immediately that 
(5.21) a=cb (c= 0). 


Under condition (5.21) one obtains (this is the first third of the proof of Theorem 3) 
not (5.19) but merely that w(¢), Z(t) exist on O<¢<oo and are bounded there. 
However, the set of vectors in 6 space excluded by condition (5.17) is easily 
seen to be of Lebesgue measure zero. Hence, using the nonsingularity of R, 
one can give the following modified form of Theorem 5: Jf (5.20) 7s satisfred, 
then for almost all y (5.19) holds, and on the exceptional n set u(t), T(t) are bounded. 
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Abstract 


Asymptotic solutions of linear systems of ordinary differential equations are 
employed to discuss the relationship of the solution of a certain “‘complete”’ boundary 
problem. 


ax 

Bh = Aya (t, 6) a(t, 8) +0 + Aiplt, 6) %p(6 8) 
hy 4X2 
Mt 2 = Aas (i, €) a (t,0) + -++ + Aaplt€) X(t, €) 
hy dx, 
ae =Apy(t, €) % (4,6) + +°+ +Apol(t, €) xp (it, €) 


R(e) x(a, €) 4+ S(e) x (b, &) =c(e) 


as é—>0F and the related ‘‘degenerate’’ problem obtained by setting e=0. Here the 
h; are integers, O<hy<++:<hpy, x; is a vector of dimension 7;, A;;(t, €) are matrices 
. . . ay 
of appropriate orders with asymptotic expansions, x is the vector {| : ], R and S are 
- : 
x 
square matrices of order >) n; and «> 0. Z 
eal 
It is shown that under certain conditions the solution of the ‘ ‘complete’? problem 
as ¢->0* approaches a solution of the ‘‘degenerate” differential system and satisfies 
”, appropriate ‘degenerate’ boundary conditions. 
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1. Introduction 


We are concerned with showing the relationship of the solution of a boundary 
problem (1.1), (1.2) as e->0* to the solutions of a related degenerate problem 
(1.3), (4.4). The problems are 


£5 (t, €) = Aral, ©) % (6) +--+ Arp lé 2) ple, @) 
on cn ge talts€) = Anall 8) a1(6,6) +--+ Aap, 8) a9 (6 0) 
ete © 5, (t, 6) = Apilt, €) ma(t, €) +--+ Applt, 2) Xplt, 2), 
(1.2) R(e) x(a, €) + S(e) x(b, 8) =cle), 
i i) ee we AO 
is 0= Ag, (t, 0) 4 Asp (t, 0) %5 
Gat 0) Age (t, 0) Xp, 
(1.4) R(O) x(a) + S(0) x(6) =c(0), 


where the h; are integers, O</g</g<---</h,=h, x; is a vector of dimension 
b 


n;, m= >\n;, A(t, €);; are matrices of appropriate orders with asymptotic ex- 


I 
j=2 Lay 
pansions, x is the vector ( ) R and S are square matrices of order (m,+m) 
and e>0. Xp 

Under three hypotheses, H 1, H 2, and H 3, we shall prove Theorem 1 (Sec- 
tion 6), which embodies our results for the problem indicated above. We begin 
by reducing the problem (1.1), (1.2) to a canonical form (2.15), (6.3). We show 
that the solution of the canonical boundary problem has a limit as e—+0* which 
satisfies the corresponding degenerate differential system and , of the degenerate 
boundary conditions. A discussion of the hypotheses is given in Section 7. 

The author wishes to express his deep appreciation to Professor H. L. TuRRIT- 
TIN for his guidance and encouragement during the preparation of this paper. 


2. Preliminary Transformations 
If A,,(¢, 0) is non-singular, a<?<8, the last equation of (1.3) may be solved 
for x, in terms of *,,..., %»-; and substituted into the preceeding equations of 
(1.3) to give a system of the same form as (1.3) in %1,..., %»-,. The last equation 
of this form is 


(Agen — Agar ppp Aes ict + (Apeugpaa a Apap App Ap pi) % a: 


Thus, if (Agee gg eA 1) is non-singular, a<¢?<b, we may solve 
this equation for x,_, in terms of %1,..., %»-2 and repeat the process until we 
solve the first equation for x,, which is a differential equation. If this process 
can be carried out completely, step by step, and %,, %»_1, ---, %» all eliminated, 
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then E. R. Rane [8] has shown there exists a non-singular transformation, 
OXeSe, aStSd, of the form 


(21) x(t) =(L) +e) (BO +2 O21) (ZO +e A) v@ 2) 


which changes the differential system (1.1) into the form (2.2) with corresponding 
degenerate differential system (2.3). 


ai y(t, €) = &) Vi Eee hy Crp lt, é) p(t, é) 


e +i Ya(t, €) = Cor(t, €) W(t, €) + Cop (t, €) Yp(t, €) 


(22) : 
el 5 Vp (t, é) = Corll, é) y(t, é) aa Coplt, é) Vp (t, é) 2 
4 y= Ci) mt + Cpl 0) 9, 
0= 63,6 0% 7-2 Czy 
(2.3) 3 - ) Vt 2p ) Vp 
O= Cy, (4,0) +--+ Cop(Z, 0) Vp 
where C;,(¢, 0) is non-singular a<i<b, j=2,..., #, and in particular 


y, -1 
Crp lt, 0) =A,,(, 0) and Crd pt Aga gr Ape PA 4s etc. 


Further, L may be chosen so large that the elements of C;;(f, €) are O(e*) for 
any particular given large integer «>0, 7=-7. We shall have occasion to assume 
that this has been done. 


For an N® order differential system of the form 


oe eri [dal ei) z 


the most general asymptotic expansions of solutions have been given by 
H. L. Turrittin [9]. In particular, he has given sufficient conditions for the 
existence of a transformation z= H(t, ¢) w(t, e), where 


K 
(2.5) HG, aha etl Ae), vy and K suitable positive integers, 
k=0 


which will transform the equation (2.4) into a new equation 
(2.6) of MY (5, A) (6, €)) + Bit, &)} wl, «) 


where B(i, )=O(1) and the characteristic polynomials A, (t, €) have the form 
rB—1 


(2.7) Aj(t,e) = Di eA), 7=4,2,..,N 
and A,(¢, €) =A,(t, €), or A,(t, €) A, (t, 8), aS<tSb, O<eXeH. 


If y>1, the problem Petia have been adjusted in the beginning by introduction 


of a new parameter e””—¢,. We assume that this has been done, so that without 
loss of generality we may assume 7=1. 
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Further, if the characteristic polynomials /;(¢, ) are such that 
Re{e"A,(, e)}S---SRefe“Ay(t,e)}, for a<t<b and 0<eSe, 


there exists a fundamental matrix solution of (2.6) of the form Wé(t,«) = 
Fit, e) E(t,«), where when aX tb and 0<eXuy 


Pe ies ae Ne. 
: Omen 
ENG e : } Ede we) — We tle? : 


dare vey 


and F,,~ 1 Fix . Here B;; a if 1-7; B,;=0; and F,,9(¢) is non-singular 
=0 

when pee. Also E;=I; Bike PE da (a, €) do}, where J, is an identity matrix, 

and 4,, are the distinct ohavactenstic polynomials. 


Thus a transformation of the type 


ian ate 
(2.8) Wy (One) @ Malt e) z(t, €) 
0 ete ae (2, €) 


will change (2.2) into the form 


ate = C11 (4, €) 4+ {Grolt, €) H(t, €)} % ary + {Cpl ea e)}%p 


d ‘ a i 

5 gus = tH, Cai} 4+ Ha" {Cay Hy — 0 5, Hal aa +--+ (Ha" Cap Hy} % 
Oa: : ) : 
de 5? = {H5" Cys} 4 + Wy aa Hy'{C, pH, — ete Hs} Zp 


Thus, if TuRRITTIN’s results apply to the individual equations, 
be Oh : 
(2.10) ae y;(t, €) =C,,(t, 2) yj(8),  7=2,...,8, 
as we shall assume, the transformations H,(?, €) may be chosen so that 


= ae 
H;*(t, e) {ii (t, ©) H(t, e) — © Hy lt e)} 
has the canonical form shown in (2.6). Also in transformation (2.1) L may be 
chosen such that the elements of H;*(t, e) C;,(t, e) H(t, €) are O (el) as e>0*. 
We may assemble all the successive transformations that have been made into 
a single transformation y(t, )=H(t, €) z(t, e), where 


df 
(2.14) H(t, e) => H,(t)e, | J a suitable large integer, 
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in such a manner that this transformation will change (1.1) into 


aa = Dy lt, é) ay a Dy o(t, é) 49, 
(2.12) 


d 
é ae = Dey (t, &) 4 + Dealt, €) 22, 


where = 
Dyo(t,e)=Ole),  Der(t,e)=Ol(e"), Dealt, 2) = (6;;4;(4, €)) + 2" Das, 


Do(t, €)=O(1), and A,(t, €), 7=1,...,m, are the non-vanishing characteristic 
polynomials associated with the differential systems (2.10); and where moreover 
by hypothesis 


dy = Ag So eg, Eg St Sg dg Fy at Bt Ay 


It is advantageous to make one more transformation on the system (2.12), namely 


Py rr QujnneO 
O Py 

(2.13) 4=Pi)m(Le), %=] . se y(t, €) = A(t) ue, 
Oct wee 


where F, ; (¢) is a square matrix whose order is the multiplicity of the characteristic 
polynomial /,,(¢, e). Thus (2.12) becomes 


fa Bo (0) {Parlt,€) Ble) — GP f a + {EME Dr alt, &) Pal} mp 
eh 8 ={P*() Darlts€) RO} + BM) [Daolt. ©) RO — FE BO} we. 


Choose A(t) such that BS TEA 0) A(t), P(a)=1,, and F,;(¢) such that 


d — a . ae 
Gz Fai) = (Dart, 0));;4;(4, B;(a)=1,, where (Dyo(é, 0));; is the 77 block of 


the matrix D,, partitioned in the same manner as P,. 


7 


Thus the transformation 


(2.14) yee He) Gs ie bse) 


will change (1.1) into the canonical differential system 


d 
me (t, €) = By (t, €) w(t, €) + Byo(t, €) u(t, é) 


(2115) oe 
or 7g val, €) = Ba, (t, €) w(t, €) + Boo(t, €) ua(t, €) 


with a related canonical degenerate differential system 


(2.16) 


where «,(/) are the characteristic roots of the matrices C;,(¢, 0) of the equation 
(2.2), all of which were assumed to be non-zero on a<t< b, which implies in 
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turn that the solution of the canonical degenerate differential system is u,=c,, 
u,—=0, where c, is an arbitrary constant vector of appropriate order. 

Further, this conversion of the differential system (1.1) into (2.15) is such 
that a fundamental matrix solution W(t, ¢) for the system (2.15), when TURRITTIN’S 
results apply, has the form 


QA7) Wty) = (LF) Ee). 


Since we are treating the boundary problem as a whole, the transformation 
(2.14) will affect the boundary form (1.2) as well. This effect will be considered 
in more detail in Section 4. 


3. The Canonical Problem 

We make the following hypothesis. 

HI: (1) The matrices A;;(t, e) indicated in (1.1) have asymptotic expansions 
of appropriate high finite orders. 

(11) The matrices A,»(t, 0), Ap—s,p—-1(¢, 0) —Ap_a p(t, 0) App, 0) Ay pr @ 0) 
and similar matrices referred to in Section 2 are non-singular on the interval aStSb. 

(iit) There exists a non-singular transformation x(t, e)=H(t, e) u(t, e), where 
H(t, )=Hy(t)+eHy()+---  Hy(t), a<tSb, OS eX eq, which will convert (4.1) 
into the canonical form (2.15). 


(7v) The m non-vanishing characteristic polynomials 2, (t, : satisfy 
Re{e~ A, (t, e)} S$ Re{e Ag (t, e)} S++. S Re{e A, (t, €)}, 
GSTS O78 ORS 


If X (¢, e) is any fundamental matrix for a system of differential equations of 
the form See é) x(t, ¢€), €>0, aStSb, then any particular vector solution 
L(t, ©) must be of the form /(t, e)=X(é, e) l(e). Thus, if J(é, e) is to satisfy the 
boundary conditions F (¢) l(a, jes ) L(0, patton ), we must have {R (e) X(a, e) + 
S(e) X(b, €)}U(e)=c(e). Thus, if Ale ={R (ce) X(a, e)+ S(e) X(b, e)}, L(t, e) will be 


unique if A(e) is non-singular, for in eens event 
(3.4) Ki, &) = X(é, 8) A (e).c(e). 
The limit problem is then the computation of the 

jim /(¢, é) = lim X(t, Se) Am(ene(e):. 


To evaluate this limit we need more detailed information about the structure 
of AV (e). 
4, A-1(e) for a Canonical Problem 
We assume that we are dealing with the canonical differential system (2.15) 


which has been obtained from (1.1) by the transformation x (/, e) =H (i, €) u(¢, €) 
of H1—(iii). This transformation will change the boundary conditions from 


(1.2) into 
(4.1) M(e) u(a, €) + N(e) u(d, €) =c(e) 


* [p(t)] represents a function p(t, e)= p(t) + (4, ¢)&, y>O, |p, (¢, €)| <B. 
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where 2 sad 
M(e)=R(e) H(a,e) and N(e)=S(e) (0, €). 
We make the following hypothesis. 


H 2: (i) R(e)=Ry+eRy(e), S(e)=Sot+ eS, (e) where the elements of Ry (e) and 
Si (e) are bounded for 0S eS eq and the rank of (R(e):S(e)) =m +m for OS eS é. 


(ii) The non-vanishing characteristic polynomials have non-zero veal parts, 1.€., 


Refe* A(t, e)} SS Refe“ A, e)}<0<Re{e“"4,4,(,)}S---S Re{e"2,,} 
on ast<b, 0.< 66: 


If we choose for the fundamental matrix'the one indicated in (2.17), we have 


: Baber 
@=(me i+ (™  & »)p 
If D(e) =det A(e) =O O<eSXéq, A(e) will be non-singular. We have 


D(e) wd, Sy (€) 6%, 
where 


(i) % covers some finite range, 

(ii) w,(€) are distinct quantities, each of which is of the form 5 On, (2, €), 
wnerel, |=, 452.5, 77: ;J2fand (ko, ky,---, 2) IS any permutation of ole ere 
0o(2, €) =0, 0; (0,8) = ct Fay o,é) do, 721, 


(iii) the coefficient ae ay, (€) 20, 2, (e) =O (1) as e> 0". 
For a discussion of the zeros of such exponential sums, see TURRITTIN [10]. 


All terms indicated in (11) need not be present; however, one is of particular 


interest, namely the term a(e) e®) where w(e)= >) 0;(0, e). (We note that, 
j=k+1 

if k=m, then w(e)=0.) An explicit expression for the leading term of the coef- 

ficient function (e) can be given as follows. Let the columns of R(0) H(z, &) 

and S(0) H(d,e) be a1, #9 respectively, 7.e., R(0) H(a, e)=(%4:-°: cabs 

S(0) H(0, €) = (12: +++ %,4m,2) and let Q(e) be the 7,+m"™ order square matrix 


Q = (4 + a9: +: Ope a Fi no) Raa te Me ee 


(4.2) foam 
*Onthi 1: MM +tet1,2°°°°* CORES 


The leading term of a(e) is the determinant of Q(e). In general for ¢ sufficiently 
small it can be shown that if Q(e) is non-singular (see for instance HARRIS [5], 
page 88), 
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where 
b 
Pre ae (6, expe“ f ee ath} 
= (6;;exp{ons4;(b,2)}), 7=1,2,...,.m—R, 

and hence 

ee T6 0 
(4.3) fed (3 ee ES ae ) ([Q (e)]) : 

OO E26, 2) 


It will now be shown that the computation of the limit of (3.1) as e+0*, @.e., 
the evaluation of lim, X(¢, é) A1(e) c(e), is essentially the question of the limit 


as ¢—>0* of the first 1, rows of the matrix 21 (¢). To establish the nature of the 

elements in the first 1, rows of 24(e) we must examine the transformation H(é, e) 

of Hypothesis H 1—(iii). H(é, e) is a product of four transformations, (2.1), (2.8), 

(2.13) and * = ) where J is a constant matrix which renumbered the last m 
0 

unknowns so that the characteristic polynomials are ordered by their real parts 

as shown in H 1—(i1v). 


where the multiplication indicated by [] is a polynomial in ¢ with a non-singular 
matrix for coefficient of ¢«®. This transformation does not change the essential 
character of the boundary form R(e) x(a, e)+ S(e) x(d, €)=c(e), since 


([Rol) 


=e) 
& 
! 
a 
‘Ss 
Il al 
J 
& 
| 
2, 
Ss 
S 
! 
B 
i) 
B 
S 
} 


where 


Let us consider the effect of the next transformation on R(e). Partition R(e) 
into blocks of columns containing , % ,...,%,, columns respectively; 1.e., 


Ret ReR, #2 OR,)a Tis 


te 0 

= 0 : = fits = 

Re) {2 2g | = (Ri: Re Mala 8): Ry Hp (a, 8), 
OREUDL ORT} 


and the matrix H;(a, ¢) affects only the matrix R;(e). Let us consider one such 

product R(e) H(a, e), momentarily dropping the subscript 7. H(qa, «) is a finite 

product of two types of transformations % (a, «) and -4@(«); more precisely (see 
Arch, Rational Mech. Anal., Vol. 5 15 
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TuRRITTIN [9], page 96), 
H(t, 0) = Lilt, ) Mle) Alte) Mle)... Flt, 2) Ge), 
L,(t, €) =Ljo(t) + eL;,(t) +e ei Lp,(t) , Ljo(t) non-singular on a@S/S aa 


and 
M,(e) = (Sys ete), 


] 


where m is the order of H(t, «), k;, and k;, positive integers, and 


Al, if 0<k, SkSkj, Sn 
u — . 
ree 0, otherwise. 


For example, a typical -@(e) would be 


4) oe 0) 
Ov. 
1 
Oe 
ot 
ée O 
4 
= 
6) ae Owed 


Consider R(e) Gla, e). Since A(a,e) and R(e) have the same essential 
features, so does R(e) G(a, «), i.e. R(e) G(a, e) = ([R(0) Lio (a)]). Multiplication 
of R(e) Gla, e) on the right by (2) multiplies some columns at the beginning 
and end by unity and every other column by a different power of e. For definite- 
ness, let the highest power of ¢ be J. Multiplication of R(e) Z(a, ce) M(e) on the 
right by ¥,(a, €) replaces each column by a linear combination of the columns 
of RY, and each column of R Y.M, has ¢° for the lowest possible power of 
é in the columns. Thus, if 2,-+-%—™m columns of constants were adjoined to 
the matrix R(e) G(a, 2) M(e) Ala, e) and the determinant of this augmented 


matrix were evaluated, the lowest power of ¢ that could occur in the expansion 
Aa 
of this determinant would be ¢ 2 and not ¢® due to certain column dependence 


of the matrix R(e) GA(a, e) Ge) Ala, e). Multiplication by the remaining & 
and .@ transformations and a similar argument gives rise to the following state- 
ment. Let 


(4.4) det H,(t,e)=«"[h,(t)],  h,(t)0, a<t<bd. 


Any matrix with ,-+-m—n, columns of constants adjoined to the »,; columns 
of R;(e) H;(a, €), i=2,...,, will have a determinant which, when expanded in 
powers of «, will begin with the term in ¢ or possibly some higher power of e. 
Similar results apply to the matrix S;(e) H,(b, ), +=2,..., p. Further, if we 
select any k, columns from R;(e) H; (@, €), SAY ,@y,...,a,,, and m;—k, columns 
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prom S;(6) 4.(6; 2), Sayidy aig +, @q,such that (a; ss; a,,) 1S any permutation 


of (1, 2,...,”,) and adjoin ,-+m—~n; columns of constants, the resulting matrix 
often has a determinant for which the lowest order of ¢ that occurs is ¢%. The 


last two transformations are of the type already considered and do not change 
the previous statement. 


Let us partition M, N, and Q in the following manner: 


me =(50 ey, me) =( oa) a=(0 pol 
M2, Mp» Noy Np» Qo, Qo» 


where M,,, N,, and 9,, have order m, and M,., Nos, Qe2 have order m. We 
see that 


M, , (0) a5 N, 1 (0) = 22, , (0) 


(4.5) M1 (0) + N31 (0) = 22; (0), 


7.e., the elements in the first ”, columns of 2 are O(1) as e—0*. Thus, often 


det (2(¢) =O (e") where restate (see 4.4), and the cofactor of any element in 
the first 2, columns of 2 is O (et). 
We make the hypothesis, 
H 3: The matrix Q(e) as given in (4.2) satisfies the conditions: 
(2) Qe) ts non-singular, O<éeSép, 
(12) the elements in the first ny rows of Q4(e) ave O(1) as e>0*. 


Hypothesis H 3—(i) assures us that 44(e) has the form shown in (4.3). 


5. Evaluation of lim X(é, e) A-!(e) e(e) 


Combining (2.17) and (4.3), we have the following representation of the unique 
solution of (2.15) and (4.1): 


I(t, e) = X(t, 2) A4(e) c(e) 


it! 0 PEO 0 
== ((2}) |) 0 2,2.) 0 Oued ) ([24 (e)]) c(s) 
0) Oreste euce) Om ere = bice) 
in 0 0 
=({1]){ 0 £,(@2) 0) ((24(e)]) c(e). 
0) 0 erty é) VD sees (ey é) 
We have 
fen (t6) = (6, exp{e" f a;(o, e) ao}), an pre eas 
and 


b 
oy 2) Lm 40, ©) = (0, exp{— a” f Ase (0, €) do}), 4 =A, 2,222, a hy 
t 


45* 
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and hence * 
E,(t, €) >Og, a<OStsb, 
Bot eat ito ee t= 0, = 0, 


exponentially fast due to H 2—(ii), and uniformly in ¢ for the indicated intervals. 
Thus 


(5.1) im X(6,e)4-¥() o(e) =( 20 oes 


= (3 ax 0)3126,—05 
where the vector J, =,,(0) ¢ (0) +, (0) C(O), and 
OA (eee ee @12(2) if Q,,(e) of order n,. 


(We note that, if k=m, b may be included, and if k=O, a may be included.) 
It is clear that the limiting constant vector / 


(5.2) 1= lim 1(t, e) = (ae (0) + 2120) “ee = & 
sate ‘ : 


is defined for the interval a<i<b and as a function of ¢ is a solution of the 
degenerate differential system (2.16). We shall now show that this lmiting 
solution / satisfies n, degenerate boundary conditions. 


6. Boundary Conditions Satisfied by the Limiting Solution 


Multiplication of the boundary form (4.1) on the left by any non-singular 
matrix of constants will give rise to an equivalent boundary form. We have 
partitioned 27 (¢) and Q(e) as follows: 


hence 


Q) 1 (0) 211 (0) +.Q12(0) 22:(0) =I, 


and the matrix (Q,,(0):Q,.(0)) has rank m,. Using (4.5), we have 


(6.1) 211 (0) (Mix (0) + Nii (0) +. 2(0) (Mai (0) + Noi (0)) = Jy, 
Further, there exists constant matrices , and F,, such that the matrix 
(6.2) R= 2,1 (0) Q,5(0) 

Bm, Fe 


is non-singular. 


* The symbol O, here represents the zero Square matrix of order R. 
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Let us replace (4.1) by the equivalent boundary form 
(6.3) M(e) u(a, ©) + N(e) u(b, e) =Z(e), 
where M(e)=FM(e), N(s) =F N(e), and C(e)=Fc(e). The degenerate bound- 
ary form corresponding to (6.3) is 
(6.4) M(0) u(a) + N(0) u(b) =Z(0), 


where 


ito) =( se and Ho=(a0 Pee 


Substituting the limiting solution / into the left-hand side of (6.4), we get 
S40 oh M, N 
mrey(5)+¥0(5)= (Go) + Roda) 

0 (0 0) + .N,,(0)}2, 


By direct computation we have 


and hence by (6.1) 


Further, 


so 
C1 (0) = 24, (0) ey (0) +2, 2(0) co (0) = 4. 

Thus, the limiting solution / satisfies the first n, degenerate boundary conditions 

of (6.4) corresponding to the boundary form (6.3). 

Without loss of generality we may assume that the boundary form (1.2) has 
been replaced by the equivalent one obtained by multiplication on the left by 
F as given in (6.2). Further, the solution of the canonical problem will provide 
the solution of the original problem through the transformation (2.14). 


Pe 

ee aioe), 

0 AOL 
The transformation H(t, e) was defined for aXi<b, OSeSéq, and lim, H(t, é)= 
H(t, 0) exists, a<¢<b. Thus the limiting solution for the problem (1.1), (1.2) 
will be 


(6.6) y(t) = H(t,0) ( 


(6.5) y (tye) = HG, ) | 


A) 0 i Gato; 
0 BWR 


Theorem 1. Under hypotheses H 1, H 2, H 3, the two-point boundary problem 
(1.1), (1.2) has a unique solution y(t, €) on aStSb, O<eSéq, such that the 
lim y(t, €)= y(t) exists on the open interval a<xt<b, and uniformly on any closed 
e—>0t 
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sub-interval aX 6,<t<6<b. The function y(t) satisfies the degenerate differential 
system (1.3). The limits, y(a+-0) and y(b—0) exist and satisfy the first n, boundary 
conditions of the degenerate boundary form (1.4). 


7. Remarks 


(1) H1 is a condition of regularity imposed on the differential system to 
ensure that solutions to the degenerate differential system exist and that 
asymptotic solutions exist for small ¢ and ¢ in the interval axtxb. H2isa 
regularity condition on the boundary form and a restriction on the characteristic 
polynomials /,(¢, €) which allows us to determine the essential character of the 
matrix A(e). H 3 defines a “‘regular” problem and assures the existence of the 
limiting solution. H 3 could be replaced by the assumption that the coefficient 
function a(e) for which det Q(«) is the leading term is #0. a(e) is the deter- 
minant of some matrix We). If We) is non-singular, 0< ¢< é9, A(e) has essential- 
ly the same form as before, and the limiting solution will exist only if the first 
n, rows of S/1(e) are O(1) as e—0*. 

Elementary considerations show that Q is singular for the following example, 
but the limiting solution exists, satisfies the degenerate differential system and 
the first boundary condition: 


dan 


= 8 %,(t, &) + X(t, €) 


and 


be d (0,0) + (1 zi x(1,0) =(%). 


Here we have 


and 


However, if we consider the same differential system with the boundary conditions 


10\_ 240 
i, 5) 209+ fs i) athe=e 


o=()'), a(s)== [1], 


and we see that the lim, X(t, €) A14(e)c does not exist. 

(2) The results of this paper are most closely related to the work of G. G. 
CaN Jr. [2], W. R. Wasow [17], and I. S. GrapsTEIN [4], who consider single 
N™ order differential equations. CHAPIN and Wasow consider the case where « 
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conditions are specified at one point and N—« at another point. CHAPIN’s 
differential equation is more general than that of Wasow, but his assumptions 
on the characteristic roots (when his problem is restricted to that of Wasow) 
are more stringent than Wasow’s. GRADSTEIN considers an initial value problem 
which is essentially contained in Theorem 1 of this paper. For our treatment of 
the initial value problem, we set R=J, S=0, and the requirement that Q be 
non-singular (or A (e)) implies that all the characteristic polynomials have negative 
real parts. A similar treatment of this type of initial value problem for non-linear 
systems is given by J. J. Levin & N. LEvinson [7] without recourse to asymptotic 
expansions. 


This research was supported in part by the United States Army, contract No. 
DA-11-022-ORD-2042. 
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Uniqueness of Radiative Solutions 
to the Schroedinger Wave Equation 


C. ZEMACH & F. ODEH 


Communicated by W. MAGNUS 


1. Introduction 


This paper is primarily concerned with the uniqueness question for the exterior 
Dirichlet or Neumann problem associated with the Schroedinger wave equation 
with a suitably restricted real potential V(r): 


[V2 + k? — Vir)] y(r) =0, n=O. (1.1) 


The domain of definition, D, of y consists of all the points in a three-dimensional 
space which lie outside a closed bounded surface 2. It is assumed that 2 is 
regular and y is regular *. 

It is well known that the solution for the same problem associated with the 
reduced wave equation (V?+k?)y=0, which obeys SOMMERFELD’s “‘radiation” 
condition 


2 
Jim. { = et iky| dA=0 (1.2) 
ZR 

where 2p is the surface of a sphere of radius R, dA is the element of area of L’p 
and 7 is the polar coordinate, is uniquely determined by the Dirichlet or Neumann 
data on the surface 2. This was rigorously proved by FREUDENTHAL [2] (for the 
two-dimensional case) and others; then by Retiicu [4] for any number of 
dimensions. Later MIRANKER [5] considered the case of the Schroedinger wave 
equation with a locally integrable potential which dies off at infinity as 7“, 
> 3, and proved that zero Dirichlet (or Neumann) data on the surface 2» imply 
that a bounded solution vanishes identically outside some sphere. This was 
sufficient to prove uniqueness when the potential is bounded. 


Our main results are: 


(1) under suitable restrictions (less strict than MIRANKER’s) on V (see Section 2), 
there does exist a uniqueness theorem for the problem at hand. 


(1) We also prove, incidentally, that the radiation condition implies the 
boundedness condition: y(r) =O (r+), for the solutions considered. Our method 


of proving this depends on an iteration procedure which resembles the one used 
by ZEMACH & KLEIN [6]. 


w Regularity of surfaces or functions signifies that the application of GREEN’s 
theorem is permissible. A sufficient condition on y is that it be absolutely continuous 
[1]. The characterization of a regular surface is given by KELLOGG [3]. 
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Similar results, under different conditions on V, have been obtained inde- 
pendently by Karo [7] by the use of more complicated methods. 

Finally, we remark that these considerations fail when k=O, and we give a 
counterexample in this case. 


2. Properties of the Potential 
We adopt the notation 


I(r, 2) =f We gee 


[r—s| 

where tT is a subregion of D and define 
|= 22) Max 10), (2.1) 
le = (222) Max I(r, 7). (2.2) 


The considerations of this paper apply to all potentials V(s) which obey the 
restrictions: 


() ||VIl<~, (2.3) 
(wz) I(r) +0 as 47>, (2.3 b) 
(a2) I(1) is continuous in 7, Sic) 
(wv) V(r) is bounded at infinity, (2.3 d) 


i.e., there exists constants M and S such that |V(r)| <M for 72S. 
If V(r) is bounded everywhere, then (77) is satisfied trivially. These hypoth- 
eses embrace potentials of order O(vy~?~*) at infinity and include others of less 
regular asymptotic behavior. It is clear that V(1) must be locally integrable. 
Condition (iv) is used only in Lemma 5 of the next section to establish boundedness 
of solutions to (1.1). It is unnecessary if boundedness is assumed in advance. 
We take note of a few inequalities suggested by the conditions above. The 
triangle inequality implies that 
|r—s| anes Cee 
|r—t||t—s] ~ |[r—t| ' [t—s|° 


(2.4) 


If (2.4) is multiplied by |V(t)| and integrated over any region t, we infer, for 
any 1, s, that 

1 |V(t)| Sl tr OS, 2) I 

4n J |r—t||t—s| dts |r—s| ~ |r—s|- (2.5) 


Alternatively, if o is a region outside the S sphere defined in (2.3d), then 


1 |V(6)| : 
even = | tS 


|-s|21 |t—s|<1 


1 flv gy. 
= 4n once a 


6 |f—s|<1 
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Now, noticing that ces 7.7 has its maximum when s=r, we get 
t— Jel 
= —||V M. 2.6 
ae olf dts |V||,+ (2.6) 
Moreover 
1 dhe Sead [V(t)| V(t) 
aa J [e—t|s—tP = 4a ]r— af 1 ana iumernaea 
(27 
< LIV lor 2M) 


ae 
Additional properties of the potential are deduced in the lemmas below. In the 
following lemma, o(«, r) denotes a spherical region of radius «, centered about r 
and I(r, «), I(r’, «) are used in abbreviation of I(r, o(«,1)), I(1’, o(«,7")) re- 
spectively. 

Lemma 1. If « is sufficiently small, then for all reD, I(r, a)<1. 


Proof. Let G,o’, x be the complements in D of o(«, r), o(a, 9’) and cua’ 
respectively, and let t be the region of points contained in either o(%, 1) or 
o(a, r’) but not in both. Suppose |r —17’|< $a. As r—r’ the volume of t—0, 
so that 


Par a) — I(r’, «)| = | [(r) =r +|I(r, o) —I(r’, o’)| 
|r—r’| |V(s 


capers |r—s| 
“x 


<| I(r) — 1(r)| ds +2 g) an Ids 


(NP Be 7 Sa. 


Hence, J(r, x) is continuous in Yr for fixed «. Clearly, for each fixed r, J(r, «) 
is continuous in « and --0 as «+0. Now choose A so large that [(r)<1 if v>A. 
It follows that J(r,«)<1 for any « if y>A. But J(r,«)~+0 uniformly with 
respect to r as a—>0 1f y< A; hence the lemma is proved. 

Let {Rj} 1SiS<n be a finite sequence of positive numbers, decreasing in 
magnitude; then D may be divided into a finite number of regions Dy, D,,..., D,, 
such that : 

reD, if reD and r2ZR,, 
reD; “i, HED Wand "hi Aa 4S51S5"—1, 
VED, wlie PCD ON. ais at. 


Lemma 2. The sequences {R;\ and {D;\ described above may be chosen so that 
lV \lb,<4, and moreover R,> S. 


Proof. Choose A and « as in Lemma 1, and choose M” so large that 
[eas = 
2 s = 


s=M’ 
Then set R,= max 1SeeA, M'\. With R, determined, the remaining numbers 
R,, R3,... are to be chosen successively in such a way that 


J\V(s)|\ds<a. 
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It is seen that a finite set of R; suffices. Clearly, if 7>A, I(r, Dy) S1(r) S31 


while if r< A, ih 
D) <2 f@lass2 jee Nds<2. 


SR, s=M’ 


Furthermore, if 7=1, 
I(r, D) < tc ms) Pell as + dee assi++ {IM Jds<2. 


|jr— pie ir ee 


Thus, for all z, 
IV Ilo, = (22) Max I(r, D,) <4. 


3. Asymptotic Behavior of Solutions to the Wave Equation 
Definition. A wave function w(r) is a regular solution of (1.1) which satisfies 
the radiation condition (1.2). 
The next three lemmas prepare the way for Theorem 1, which demonstrates 
that a wave function is of order O(r7) at infinity. 
Lemma 3. If w(1r) 7s a wave function and 2, is a sphere about the origin of 
vadwus s, then as s—> oo the following estimates are valid: 


(a) [| 2S — skys) [a4 019), (3-1) 
(b) Sly(s)|d4 =0(5) (3.2) 
(c) eat (s). (3.3) 


Proof. Equation (3.1) follows directly from (1.2) by use of the Schwarz 
inequality: 


‘| fe inyldas| [aa ie iky| i4| < (4s) 0(1) =0(s). 


To prove (3.2), multiply (1.1) by y*(r), take the imaginary part of the result 
and integrate over the part of D inside the s sphere; then GREEN’s theorem yields 


x OY y =f 
(ieee ea 


aa 


where 7 is the normal into D. Expanding condition (1.2) we get 


lim {| ge] + Ble + ia (y* pea er \hdd =o. 


ae) 


Substituting for the last two terms, we get 


0 
i, [|S Ivr) e424 fp 9 Se 


3 
es 
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which implies, since k 4-0, that 


Oy |? 
floras on fe 
2s ~s 


remain bounded. Hence 


flylaa s| saa} | sivas} =00). 


cant | 


Similarly 
(2 dA =O(s). 
Os 
2's 
In the work which follows, we make use of the GREEN’S function for free space, 


ek lr—s| 


G(r, s) =— 


4n|r—s| ’ 
which satisfies the equation 


(V2+ k?) G(r, s) =d(r—s). (3.4) 


Lemma 4. Suppose R>R,, where R, is defined in Lemma 2, and let Dp be 
the region enclosed by Xp and Xp. Then for rc Dz 


v= Pore TG anh teas as: (3.5) 
R 
where ~o(r) is composed of surface integrals over Lp and Lp. Moreover 
: fa) C 
Jim pol) = [ [Gr 5) 2 vls) — vls) G(r, )| a4. (3.6) 
ZR; 
Furthermore, functions C,(s) and C,(s) can be chosen such that 
C,(s) C,(s) 
| Po (7) es [esr cengnee (3.7) 
and ay 
JC,(s)dA,=O(R), JfC,(s)dA,=O(R). (3.8) 
ZR ZR 


Proof. Multiply (3.4) by w*(v) and (4.1) by G(r, s), subtract one from the 
other, integrate over Dz and apply GREEN’s theorem. Then (3.5) emerges with 


Po (r) =u Wir, S) aA, ms, Wir, S) dA,, (3.9) 
where f = 


Wir, s) =y(s) G(r, s) —G(r, s) 2 y(s). 
The second integral in (3.9) is seen to be finite for r within Dz or on Lp » (tas 
the sum of single layer and double layer potentials) and goes to zero as +> oo} 


hence it is bounded by some constant C, independent of R, for all such that 
v=R,. To estimate the first integral, we expand W(r, S): 


Wir, 8) = G(r, 8) {kN ys) See (s)}. (3.10) 


r—s| as |r—s|? 
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A simple rearrangement of (3.10) shows that, for fixed 1, 


7) 
(Wer, s)| |e — iy] o(4) + |p(s) )| 0(<). (3.41) 
Consequently, by Saute 3, 
jim go(r) = — J W(r, 8) dA + lim igh Ve) dA 
— oo ZR, 
=— J Wir,s) dA + lim |o(t )+0(F)], 
and equation (3.6) is proved. 
Since 
Is —(r-8)/s 
we have, from (3.10), 
1 oyp(s | y(s)| 


and 


lro(r)1S [Wes 44+ s ee 20 NGAG. 4340) 


4ns|r—s| I) 
Equation (3.12) is now seen, with the help of (3.10), to be equivalent to (3.7), 
and the conditions (3.8) are guaranted by Lemma 3. 
Lemma 5. The wave function w(r) 1s bounded in D. 


Proof. The regularity of y(r) implies boundedness in any bounded region. We 
shall use (3.5) to set a bound on y(r) in Dz which does not depend on &, so that, 
letting R—> oo, we establish the boundedness of y(7r) throughout D. Let (3.5) be 
iterated, producing the series 


S g,(") + An(r), (3.13) 


n=0 


where @p is given by (3.9) and 
Pr (¥ pee Yr, S) V(s ) Pn—1(8) ds. (3.14) 


The remainder term Ay (r) is also defined recursively through 


Dr 
A f G(r, s) V(s) A,_1(s) ds 
DR 
We define 
Bee Max. in), 
reDR 
A,, = Max |A,,(7r)|; 
reDR 
then 
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Hence a 
Ayn = (3 | V |Ipp) 


N 
< (5||V lio.) 40 
S Br($||V|lo,)"** S Brey 
because, by Lemma 2 and the definition of Dp, 
IV llow S11 V Ilo, <1- (3.15) 
Therefore, we let N — oo in (3.13) and obtain 
=> att) 


Since R, was chosen larger than S defined in (2.3d), inequalities (2.5, 6, 7) are 
applicable to the region Dg. If we employ them in conjunction with (3.14) and 
(3.7), we obtain 


ln Sse f pra |VOl lool at 
where df: 
K(s) = C,() ||V log + Co(S) {|| V llog + 24}. (3.16) 


Thus, by (3.8), A(s) also satisfies 
{ K(s)dA,=O(R). 


ZR 
Invoking (3.14) again, together with (2.5), we have 
1 
| P2(r)| ee ee IVOllna®| at 
Dr 
FE 
SIP lon i dA, 


and in general, for m=1, the same eh yields 


Jen) S (IV lon)? f a. 


Then 
vis > |e.) 
Sl ool] +> (Vlog [ KOO a 6.17) 


ZR 


S| P0(") + (1 [IP lla)? f eer 4. 


[rs 


ZR 
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By (3.7), (3.8), (3.45) and (3.16) we may deduce that, as R-> oo with 7 fixed, 
the right side of (3.17) remains finite. Thus the boundedness of (7) in Dy and 
hence in D is established. 


We are now ready for the principal result of this section. 
Theorem 1. The wave function p(r) 1s of order O(r+) at infinity. 
Proof. Let Rov in (3.5), yielding 
p(T) = Yol(r) "he G(r, s) V(s) p(s) ds (3.18) 


with wo (1) = jim P(r) given by (3.6). The integral in (3.18) is absolutely con- 


vergent in virtue of (2.3a) and the boundedness of w(s). If this equation is 
iterated, we obtain, for any positive integer N, 


=> ya) )+ By (r) 
where 
= f G(r, s) V(s) y,_1(s) ds, 


B, (r) =f (r,s) V(s) B,s(s)ds, By =f G(r, s) V(s) y(s) ds. 
De Dy 
The procedure of Lemma 5, which proved jim Ay=0 shows, likewise, that 
lim Max By (r) =0 


N>o re€D, 


because y is bounded in Dy. Therefore, for r€ Dy 


= v8) 
Define 
[Iva|| = Max |r, (1)]. 


Inspection of (3.6) shows that ||w||< oo. Furthermore, by (2.5), 


lval| = Max Gf 5—a7> Ive alllM(s)1 ¢s. 


= ||Vllo, 
S (IV 


v.)" || Poll - 
Therefore, for r€ Dp, 1.e., for 72 Rj, 
r|v(r)| <> (IV lle)” lvl 


S (1—||V ll.) * Ilpoll< ee, 


and the proof is concluded. 
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4. Uniqueness Theorem 
Lemma 6. Let & denote the angle between r, 8, and suppose r>s; then 


eiklr—s| eth (r—s cos 9) = s [1 +k s] 


hp ey 2 v(vy—s) 


Proof. Set y,=|r—s 
We have, since (1 +4”)? <1+ }a?, 


Pa? —5 COS” 


v= Vr — s cos B)? + s* sin? 


s? sin? } 
aye 
< (rv — scos#) {1 a By aie art 
= 0 x 
<7 —scos aisha 
Therefore, since %.< 9, 
2 
= < Bee. 
| 2 ”l|s 2(r—s) © 
Also, 
lm—7| Ss. 
Note that 
| eth net Ci = cee a - a we 
< 2|sin $k (y, — Gs)| S kl Qi — Gol. 
Hence 
ek rm eth pe a gre i 2 i (clk ms eik 2) 
Py Y Pi ia G 
aed ft 
OY ot - R| py P2| 
< SU+Eh s) 
— r(r—s) 
as required. 
Lemma 7. // 


aoe % etk|r—s| et k[r—s cos 4] a 
=qa | pas es (4.1) 


0 


y (r) 


where w(s) 1s a wave function, then w,(r)=0(r4). 


Proof. Suppose that 7 is large enough that r>R,. Let y,(r)=A(r)+ B(r), 
where A(r), B(r) represent the integral above over the ranges Ry<s<7 and 
1's <ov, respectively. By Theorem 1, we can write |y(s)|<C/s for some con- 
stant C; then by Lemma 6, 

|A(r)| < (ttakry i Visio as 


4n¥(v—14) Ss 
R<s<r 
ao CA. 


vy (vy? —1) 
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Furthermore, 
|B (7)| 


IIA 


re oper : sf IMisilas 


1 
s2r8 


rep iter =} V(s)| ds 


1 
s2r3 


: oye f mel is) 


I 


IA 


427 
=e 


Thus, both 7A(r) and 7 B(r), and hence 7 y,(r), approach zero as 7 ov. 


Theorem 1 shows that a wave function cannot be too large for large values 
of the argument. We now derive a complementary result to the effect that the 
wave function cannot be too small at infinity. Use is made of a theorem of 
RELLICH concerning solutions ¥ (1) to the reduced wave equation (V?+ k?) y(r) =0. 
RELLICH’s theorem states that 

lim f |y|?¢A +0 
R—-oo a5 
unless ¥(#) vanishes identically. 


Theorem 2. (Analogue of RELLICH’s theorem.) If w(r) is a wave function 
for (1.1), and iim J |y[?}@A=0, then p(r) =0. 
FO yp 


Proof. We begin with (3.18) and write y(r)=y,(r)+y.(Vr), where y,(7r) is 
given by (4.1) and the remainder w,(1) is given by 


a 0 
valr) = | |G(rs) 5 ¥(s) — vs) G(r, 8)] 44 — 
ZR; : 
1 eet —ikscos? 
——— fe V(s) p(s) ds. 
Dy 


Lemma 7 and Theorem 1, together with the hypothesis of the present theorem, 
imply that 

lim f|y.|\?¢dA=lim f|p—y [dA 

R—> co Se R—-oco Sip 


slim, J lyyt+ytyl@4 
K——> OO) ZR 


<2 Jim { J vita} i) lw [eaal 
> ool sp Sy 
=0. 


' thr : 
But y,(r) satisfies (V?+h?) y,(r)=0 for r>R, since ——_ G(r, s) and its 


derivatives do so (for 7=+-0 and s€X,). Hence RELLICcH’s theorem implies 
W(r) =0, so that the revised equation for the wave function for r€D, reads 


er ( jtk|r—s| tk (r—scos #) 
Pains pee a te Vis) vis) as. 


et res y 
Dy 
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With the notation ||y|| = Max r|p(r)|<co, we get 


1 
lvls loll f 
Do 


S|lyll-2llV lo, Sallyvll 


+—||V(s)| ds 


s|r—s| 


because ||V/||p,<4. Therefore ||y||=0 and y(r)=0 for r¢Dy. Suppose that 
the vanishing of y(r) for r€D,; i<m has been proven. We shall show that p 
vanishes in D,,,,. We suppose that the origin of the coordinate system lies 
inside Y, so that yO in D. Applying GREEN’s theorem to the region D,,.1, 
when r€D,,., we obtain a relation identical to (3.18) except that Xp, and Dy 
are replaced by Xp,,,, and D,,,,. We proceed precisely as in the above paragraph, 
apply RELLIcH’s theorem to the region 7>R,,,,, and deduce in an analogous 
manner that p(r)=0 when r€D,,,,. Therefore y(r) vanishes in all D;, and 
the proof is completed. 

Theorem 3. (Dirichlet-Neuwmann uniqueness theorem.) Let y'(r) and y''(r) 
be regular in D, satisfy the Schroedinger wave equation (1.1), the radiation condition 
and the boundary conditions 


(a) yp (r) =p" (r) =a(r) (Dirichlet condition) 
or 
opr) _ oy" (r) _ a 
(b) ate ee b(r) (Neumann condition) 


on the surface X of D. Then y'(r)=y"'(r). 
Proof. Set y(r)=y’'(r)—y’(r); then y(r) is a wave function which obeys 
a zero Dirichlet or Neumann condition on 2. 


Applying GREEN’s theorem to (Vr), y*(r) in the region between » and the 
spherical surface 2'p, we have 


y 
“R “ 


Expanding (1.2), we see that (4.2) leads to 


iA=0. 


R—-0o 


; : F) 
1 2 aa NE 
a | Mallee fat | on 
2R =R 
Therefore, by the previous theorem p(r) =0, and p'(r) =p (Tr). 

Corollary. A function which satisfies the radiation condition, and which is a 
regular solution of (1.1) in the whole space, is identically zero. 


A counter-example. In the special case k=0, Theorem 2 (and hence Theorem 3) 
ceases to be true. We give a counter-example to the corollary above for this case: 
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Let 


my 4<1 
TAR eo 31 o 
0) iA 


7{5 — 3r7] cos® ae 


he 
va) = cosd PA 
Y ; 


Then y satisfies the radiation condition, is regular in the whole space and 


[V2— 


Vjy=0, and yet w=0. 
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Mildly Nonlinear Parabolic Equations with 
Application to Flow of Gases through Porous Media 


AVNER FRIEDMAN 


Communicated by J. SERRIN 


Introduction 


The study of the motion of a homogeneous gas in a porous medium, when 
the equation of state is 


(0.1) vy=y,b" (y=density, p=pressure, yy constant), 


is reduced (see [7]) to the study of the nonlinear parabolic equation 


0.2 A (L+m)/m (Atm fu 1jm ou 
( ? dle 


where A is the 3-dimensional Laplacian, k = permeability of the medium, f= 
porosity of the medium and yu = viscosity of the gas. For isothermic flows m=—1, 
and for adiabatic flows 0<m<1. In deriving (0.2), it is assumed, as is most 
commonly the case, that the external forces may be ignored. 


Equation (0.2) is a particular case of the class of nonlinear equations 


: 7) 
(0.3) Sr = Ely (x, w)] +t?) 

where p(x, 0) 20 and dy/du>0 if «>0, and where L is a linear elliptic operator 
with coefficients depending on *=(%,,...,,). As will be shown later on, (0.3) 
is equivalent to the equation 

Ov 


=~ =k(x, v) Lu+ R(x, v) f(x, #) 


(0.3') % 


where k(x, v)>0 if v>0. General mildly nonlinear equations 


Ou 
(0.4) ar = Ds Mi (x, t, u) = TES ta (x, tw) 3% + f(x, ta) 
have been discussed in the literature. The first boundary value problem has 
been solved in cylindrical domains by OLAINIK & VENTSEL [11] for n=1 and 
by VentTSEL [15] and LapyzHENSKAyYA [5] for general n. However, for >1 
they assume that the height of the cylinder is appropriately small. 
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In this paper we are interested in existence for all ¢>0. We first consider 
equation (0.4) and prove (using Schauder type estimates) that if we have an 
a priori Holder inequality (or an a priori bound on du/ét), then existence and 
uniqueness can be established for all ¢>0. Next we consider equation (0.3) 
and prove, under some restrictions on k, f and the boundary data, that there 
is an a priori bound on du/dt. 


1. Mildly nonlinear parabolic equations 
Let B be an n-dimensional bounded domain with boundary ¢B. For any 
o>0, denote by D, the topological product of B by {0<t<g} and by S, the 
topological product of dB by {0<t<o}. Set D=D,,, S=S,. By D, we denote 
the closure of D,. We recall that a function v(x, ¢) is said to be Hélder continuous 
(exponent «) in a set G if 


TAG (aye pay ne Dao ye) aa 
a (2) pe Rants [PEO ee is finite. 
We write 
cee ee aa ey 
lolfea=lole +S ay 
ne sa, a ‘at 


We say that v is of class C* in G if te <oo. We denote by “ the k-norm of 
gain D,. 
We shall consider the mildly nonlinear parabolic equation 


ts au . Ou Ou 
(1.1) = Ahi (x, £; 4) Mag fag Haslet a, ap 
in D, with the initial and boundary conditions 
(1.2) Ul&; O=h(e)-on B, u(x,t) = ¢(x,t)-on 3S. 


We shall need the following assumptions: 


(A,) L is of parabolic type; that is, for every real vector €+-0 and for every 
(x, t)ED, == CI KX Hw CS. D4; (%, t, u) &;E; > 0. % 

(A,) f and the coefficients of L have two continuous derivatives for (x, t)€D, 
CX <<a sO 

(As) The boundary 0B is of class C?**, for some 0<a<1. 

(A,) There exists a function ®(x, 4) which coincides with h(x) on B and with 

g(x, t) on S such that |®|3,,<0o for any e>0. 

Theorem 1. Let the assumptions (A,)—(A4) be satisfied. Assume that for 
every 0>O there exists a constant M, depending only on 0, L, f, ®, B such that 
whenever a solution w of (1.1), (4.2) exists in D,, it necessarily satisfies the in- 
equality 
(1.3) |u|2sM,. 


Then there exists a unique solution of (1.1), (1.2) om D. 
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Proof. It is enough to prove existence and uniqueness in D,, for every fixed yu. 
The proof is given in three steps: (i) existence in D, for small g, (ii) uniqueness 
in D, for any @, and (iii) existence in D, by repose’ application of (i), making 
use of (ii) and the a prio inequality (1. 3). Since, by assumption, wu is a priory 
bounded, we may change the definition of f and the coefficients of L for || 
large without affecting the assertion of the theorem. Hence, without loss of 
generality, we may assume in what follows that L is uniformly parabolic and 
that the second derivatives of / and of the coefficients of L are uniformly bounded 
for (x,t) ane —o<Uu<oo. 

(i) Existence in D,, e small. Denote by 2° the Banach space of functions 
v(x, t) defined in D, and having a finite norm |v|%. We define a transformation 
w= Tv as follows: Replace in / and in the coefficients of L the variable u by 
v(x, ¢) and let w by the solution of the modified system (1.1), (1.2). By [3], 
w exists and is uniquely determined. Furthermore, since the coefficients of the. 
modified equation are Hélder continuous (exponent «), we have 


(1.4) |w [8.4 Ko(N) (P[$42+|f19) =i (N) provided |o|f<.N 


where K; depend only on N, 0, L, B and K, also on f. We may however take 
K, sufficiently large such that it is independent of , 0<eSw (indeed, we can 
just take the K, of (1.4) which corresponds to g=w). 


We proceed to show that for 0 sufficiently small T has a fixed point v= Tv. 
By SCHAUDER’Ss fixed point theorem [13] it is sufficient to show that T maps 
a closed convex set Z§ of Z defined by 


(1.5) |vJg<N 


into a compact subset of itself, and that T is continuous. 


T maps ZX into itself. For any 0<t<i®°<o we have 


| w(x, t)—w (x, 0° —a/2| Ow |e 123 

(1.6) Spe S90 | 22 Ps Ky) ote 
where we have made use of (1.4). Also, 

2 w(x, t) — © w (x9, 2) é 0 

2 w(x, — 2 w(x, <K,(N) |x — x 
By integration we then obtain 

| w(x, t) w(x, 2)| | h (~) —h (x° 3 

(1.7) [x — #9] S- ape Ki (N) @. 
Combining (1.6) and (1.7), we conclude (if 01) 
(1.8) Hy (w) SH, (h) + 2.5, (N) gh. 


By the maximum principle [9] we have 


(1.9) |wB<N 
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where Ny is independent of v and can also be taken to be independent of 9, 
O0<e@Syp. Combining (1.8) and (1.9), we conclude that if we choose 


(1.10) N=2M,+N +1 (M, appears in (1.3)), 
then TJ maps Z% into itself provided 
(1.41) DIN) ob? a1 


(Remark: N= M,+ N)+1 would suffice since H,()<M,; the reason for the 
definition of N by (4.10) is that it will slightly facilitate the calculations in 
step (iti).) 

Clearly, T maps Z{ into a compact subset. 

T is continuous on ZS. Let v, v® belong to Z{. Subtracting the equation 
satisfied by w°=Tv° from that satisfied by w=Tv and setting y=w—w®, 
we obtain 


ey 2 dae ACE ae 
(e122) >» 4% t, v) x; Ox; ale ye b; (x, t, v) a a = (a0) 


where 
0? w® 
Jil = a;; (x, t, v°) — a,,(4, t, v) | —— + b, (x, t, v°) — 
D [aj 6%) — ale eS + D Bilt 


(1.13) mi 
— 5, (4,4, »)] rans [7 (x, t, v) — f(x, é, v®)]. 


Furthermore, y=0 on 6 and on S,. Using the estimate (1.4) we get 
(1.14) Hy (y) S K(N) |F |g. 
By the maximum principle, 
(1.15) |y|§ S const. | F'|§ S const. |v — v°§. 
Using the mean value theorem, after some elementary calculations we obtain 
H8(F) < const. (| v® — v|8)3. 


Substituting this inequality and (1.15) into (1.14), we conclude that T is con- 
tinuous. 

(ii) Uniqueness in D,, epSp. Let u, u° be solutions of (1.1), (1.2) in a domain 
D,, oS, and assume that u iSrOleclasss Gein D, (later we shall construct a C* 
solution). It is then easy to prove that w= w°. Indeed, subtracting the differential 
equations satisfied by u°® from that satisfied by #, we obtain for y=u—w® an 
equation of the form (1.12) with v replaced by w(x, ¢) and with F defined by 
(1.13) with v, v°, w® replaced by u°, wu, wu respectively. It is then immediately 
seen that F=E y where E is a continuous function of (x, 7?) in D,. Since y=0 
on B and on S,, we conclude, by the maximum principle, that y=u—w°=0 
in’ Ds. 
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(iii) Existence in D,. In this proof we shall make use of the assumption 
(1.3). We wish to repeat the construction of (i) step-by-step. We shall show 
that each step can be carried out in a cylinder with the same height @ as used 
in (i). The uniqueness proved in (ii) assures that a step-by-step construction 
yields a unique solution in the whole domain D,,. 

We thus suppose that a solution has been constructed in D, for some 0<o0<w, 
and we shall extend it to D,,, (assuming, as we may, that 0+ gSyp). 

Let Z’*® be the Banach space of functions v(x, ¢) which coincide in D, with 
the solution u(x, ¢) and for which |v|Z*°’<oo. We define w=Tv as follows: 
Replace in f and in the coefficients of (1.1) the variable u by v(x, ¢), and let w 
be the solution of the modified system (1.1), (1.2). By (ii), w=w in D, and, 
hence, w also belongs to Z°**. To show that T has a fixed point it is sufficient 
to show that T maps a set Z%*® (for some N>0) defined by |v|Z*°<N into a 
compact subset of itself and that T is continuous. The continuity of T is proved 
as in (i). The compactness of T(Z%*°) is immediate. It thus remains to show 
that T maps Z%*° into itself. 


We have 


(1.16) | w(x, t)—w(x, ¢)| 21 


1—a/2 = 
[7— 0/2 1(N) 0 *? provided o <<t<f<o+o. 


Also, recalling that w(x, 0) =u(x, 0), we get as in (1), 


|w(x,1)—w(x, | — |u(x,0)—u (2, 0)| 


|x—4|% = |x — x9 |% t Ky (N) @ provided oSt<o+oQ. 


(4.17) 


Using (1.3) to estimate the right side of (1.17) and combining the inequality 
thus obtained with (1.16), we get 
(1.18) HE (w) < M,+2K,(N) 0%? where G=D,.,—D 


ao 


Since, by (1.3), Hy (w) <M, we conclude 


uM? 
(1.19) Hy*?(w) S2M,+2K,(N) 0%. 


Using (1.19), (1.9), we find that if we define N (as in (i)) by (1.10), then T maps 
ZN'° into itself provided @ is restricted by (1.11). Thus @ can be chosen to be 
independent of o, and the proof is completed. 


To be able to prove existence for (1.4), (1.2) in D, one needs, via Theorem 1, 
to establish the a priori inequality (1.3). An a priori bound on |w|§ can be ob- 
tained in some cases by the use of the maximum principle. An a priori inequality 
on H;(u) is much deeper. This question would be solved if one could establish 
H6lder continuity (in the whole domain) of solutions of linear parabolic equations, 
where the Holder coefficient and exponent depend only on bounds on the coeffi- 
cients, bound on the solution, and the modulus of parabolicity of the equation. 
This has been solved, however, only for a half strip when the elliptic part of 
the equations is of the form $" (2155) (J. Nasu [8]). 


re) 
Ox; 
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In some cases however, an a priori bound can be derived on @u/@t. This 
will be shown in § 2. Hence arises the usefulness of the following theorem. 


Theorem 2. Let the assumptions (A,)—(A,) be satisfied. Assume that for any 
o>O0 there exists a constant M, depending only on 0, L, f, ®, B such that whenever 
a solution u of (1.1), (1.2) exists in D,, it necessarily satisfies the inequality 


Pens 
ee 


0° 


0 ou 
(1.20) jule+ | 


Assume finally that the matrix (a;;(x,t, u)) satisfies the K,-condition of CoRDES 
[1; p. 292]. Then there exists a unique solution of (1.1), (4.2) in D. 


Remark. The &,-condition states that the eigenvalues 4,,...,2, of (4;;) 
belong to some circular cone in the domain A,>0,..., 4,.>>0, having the origin 
for vertex. It is satisfied in the special case of a;;(x, t, uw) = k(u) a,;(x, t) provided 
k=const.>0, and (a;;) is a continuous positive matrix. 


Proof. By [/] (using (1.20)) we get an a priori bound on the Hélder continuity 
of wu and du/dx,; with respect to x. Combining it with (1.20), we obtain an a priori 
bound on | #|§. 


Remark. In some cases, when the equation is written in the form 
<a @ Ou\ , ou Ou 
ya Ox; (4; a, Dy « Ox; ot i 


the K,-condition can be removed, and instead of using [1] we can use the a 
priort estimate of DE GiorGI [2] and Nasu [8] as generalized by Morrey [6]. 


2. Generalized flow of gases through porous media 
In this chapter we shall consider a special case of (1.1), namely, 


(2.1) Se = h(x, u) [Lou t f(x )] 
where 
(2.2) Ly u = Y\a,, (x) = *— + 1d, (x) SH + 0(x) u. 


The boundary conditions are as in (2.1). We shall first prove existence, uni- 
queness and stability, and then we shall show that the equation of flow of gas 
through a porous medium reduces to a special case of (2.1). For simplicity we 
first formulate some assumptions. 


(A,) Ly is uniformly elliptic in B, its coefficients are Hélder continuous 
(exponent «) in B, and c(x) So. 


(A,) k(x, w) has two continuous derivatives for x € B,0<u<oo, and k(x, u)>0 
Ok(x, u)/Ou= 0. 

(A,) f(x, t) and Of(x, t)/8¢ are Holder continuous (exponent «) in each D,, 
o>0, and {/20, Offotso. 

(Ag) h(x)>Oand Lyh(x) +f (x, 0) S0 on B; g(x, t)>Oand g(x, t)/0tS0 on S. 
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Theorem 3. Let the assumptions (A3)—(Ag) be satisfied. Then there exists a 
unique solution of (2.1), (1.2). The solution belongs to C2 Gnveach Di o> 0 


Proof. The proof consists of two steps: (i) Deriving bounds on wu, and (ii) 
deriving bounds on 0u/0t. 


(i) Bounds on u. Consider the function 
(2.3) p(x) = exp{A R} — exp{A(m,— *)} = (A> 0, R>O). 


We take x° and R such that 0S %,—2*2<R for x=(%,..., %,) in B, and then 
choose 4 such that Lyg(x)<—y in B, where y is some positive constant. We 
compare, in D,, the solution wu with 


(2.4) p(x) = [Fo p(x) 


We have 


h(x, w) Loy — 2% < — h(x, u) | f(x,8)| in Dy. 


Hence, the function yy — u satisfies in D, 


ov 
Secs 
k(x,u) Lov oe =O 
On the boundary B+ S, y= —u=—max(|g|§, ||.) =— CG. Using the maximum 
principle [9], we conclude that y= —C, in D,, or uSyp+(C,. 
We next need to show that u is bounded (in D,) from below by a fosztive 
constant (since k(x, #) need not be defined for w=0). Since f/=0, 


jad) 


(2.5) “=k(x,u) Lou in D,. 


Ot 


Since u>0 in D,, the right side of (2.5) is elliptic. The function u(x, t)=w(v, t) 
exp{—vt} (y>0) satisfies 


Since we have already proved that w<y-+C,, it follows that k(x, w) is bounded 
from above. Hence, if y is sufficiently large (independently of w), then 


we > k(x, u) (LZ) —¢(x)] @ in D,. 
We can now apply the maximum principle and obtain 
u(x,t) =exp{—ve}C, where C,=min (g.L.b. h, g.l.b. g), 
and the proof is completed. 


(ii) Bounds on du/0t. Using (i), it follows that we may change the definition 
of k(x, u), for w large, without any loss in generality of the proof. We thus may 
assume in what follows that k and its first two derivatives are bounded uniformly 
for «€ B, —co<u<co and that k is uniformly bounded from below by a positive 
constant. We shall need a lemma essentially due to WesTpHat [16]. 


Parabolic Equations 245 


Lemma 1. Let v(x, t), w(x,t) be continuous functions in Dy having in D, 
second continuous x-derivatives and first continuous t-derivatives. Let F(x, t, 2, %;, % ;) 
(7,7=1,...,) be a continuous function in the closure of the domain E defined by 


(x ED, 2€(v(w,4),w(x,),  4€(A4 guar 


OX; z Ox; 


Zi (a t) @w(x, ay 


OX; OX; On; Ox; 
where (q,7) denotes the open interval between q and r, and assume that ( OF /02;;) 
us a continuous positive matrix in E. Assume, finally, that 


ou 
ot 
aw 
ot 


ZF (x, t, v, ee ae for (x, t) ED,, 


On = OX; Ox; 
ow Cw 


F (x, t,w, 22 ee 
bar eee On; OX; OX; 


for (x,t)€D,, 
and that 
v>w on B he 
Then v(x, t)>w(x, t) in D,. 
The lemma holds also for noncylindrical domains. 
We now denote by 2(x, ¢) the solution of the systems 


OZ _ OR(#, u) 1 0u\2 OF (x, 2) 
(2.6) Bp) ght ee (sr) + B(x, 1) SHE 
(2.7) z= Bon S, z=h(x,0)[Loh(x) +f (x, 0)] on B. 


The existence of z follows by [3]. Next we consider the differential system 
satisfied by u"(x, t)=[u(x,t+h) —u(x, t)]/h and compare it with (2.6), (2.7). 
Using the maximum principle, we then find that |w"—z|8-+>0 as h>0. Hence, 
z=0u/et. The equation (2.6) then takes the form 


ee Gp = Rew) Ege amy t Ble) as 


Note, by (2.7) and the assumption (A,), that 


(2.9) 2(%,0)S0 on B, 2(x,7)=0 on S,. 
We proceed to estimate z=0u/0t from below. Since dk/G6u=ZO0, we have 
(2.40) SS h(x, u) Lyz + h(x, u) oan) 
t ot 

We compare z with v(¢)=—C,(¢+1) where C, is a positive constant satisfying 
(2.11) Gre thax (lelsvaliic) ve Gse> lest >. 
Then 

a) Of (x,t 

oF <k(x,u)Lyv+ k(x, w) ) ; 


and v<zon B+S,. Using Lemma 1, we conclude that 


(2A2) Ap tee CO yh aD) 
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To find an upper bound on z, we compare z with v=eg(x) where g(x) is 
defined in (2.3) and e>0. By (2.8) and (A,), z satisfies 
0 ‘ 
(243) ap he u) Lgt + Ca? m D,, 
where C, is a positive constant independent of #. On the other hand, if ¢ is 
sufficiently small, then 
= > k(x, u) Lv +C,v? in D,. 


Since v>0=z on B+ S,, we can use Lemma 1 and get 


(2.14) 2(%,t)Sep(x). 


Having completed the proof that du/ét is a priori bounded, we apply an 
a priori estimate for elliptic equations due to CoRDEs [/] (see the remark follow- 
ing Theorem 2), which can be applied under our differentiability assumptions 
on L,. We obtain an a priori bound on the Hélder coefficient of w with respect 
to x, the exponent being any positive number 6<1. Combining this bound 
with the bound on du/ét, we get an a priori bound on |u|§ for any O0<P 1. 
The proof of Theorem 3 can now be completed very similarly to the proof of 
Theorem 1. 


Remark 1. The assumptions 


(2.15) Siso, SEs0, Loh(x) —f(x,0) So 


were used only in estimating @u/0¢ from above. It is easy to see that the proof 
works well also if (2.15) is replaced by 


(2.16) se, Se, Lph(x)—f(x,0)Se  (e>0) 


provided ¢ is sufficiently small. If, however, ¢ is not small, the proof fails. 
Indeed, we show this by exhibiting a function z which satisfies 


Ope 2 pk 2 A Oe ete eee ena (Ao On IV 


a(x, 0) =2(0, #) =2(1,4) => (u > 0) 


and which tends to infinity as x=}, t—>4N provided Au>8N-+2. The re- 
quired function is 


Remark 2. Let the assumptions of Theorem} hold, and, in addition, assume 
that c(x) =0, f(x, t) f(x), g(x, t) >g(x) uniformly in x€ Band xC OB respectively 
as t—> oo. Let g(x) >0 on OB. Then u(x, t)>u°(x) as to uniformly in x€ B, 
and u° satisfies the elliptic system 


(2.17) Ly Ua (a) tor re Be 
(2.18) u*(x\=¢(x)) for- AE OB. 
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Proof. The existence of u° is well known [14], [10]. Now let w=u—u®, 
and consider the system which w satisfies: 


oF = k(x, u) Low + k(x, w) [f(%) — f(x, 4)] in D, 
(2.19) w(x;0) =h(x) — g(x) on B, ; 


w(x, t) =g(x,t) —g(x) on S. 


From the part (i) of the proof of Theorem 3 we conclude, recalling that c(x«) =0, 
that wis bounded in D from above and from below by positive constants. Hence, 
the differential equation in (2.18) is uniformly parabolic in D. We then can 
apply a theorem of [4] and conclude that w(x, t) +0 as t—> o, uniformly in «€ B. 


Remark 3. Theorem 3 and the proposition in the above remark can be ex- 
tended to the case that c=c(x, t) provided dc/@t<0 and (for the proposition to 
hold) provided c—0 sufficiently fast as t+ ov. 


If the boundary values h, g are not strictly positive but only nonnegative, 
then we can approximate h, g by positive functions h,,, g,,.. The corresponding 
solutions w,, converge uniformly in each D, to a function « which may be con- 
sidered as a “‘weak solution’”’. 


Remark 4. Consider the equation (0.3). In the physical problem arising 
from flow of gases, the boundary values are likely to be positive functions. We 
then may define v=y(x, u) (w>0) in D, with @ sufficiently small, and reduce 
(0.3) to (0.3’). Using the positive lower bound for solutions found in the proof 
of Theorem 3, we conclude that a similar bound exists for w. Hence the trans- 
formation v=y(x, #) (w>0) is valid in D,,, for some e>0. Proceeding by a 
standard argument of continuation, we conclude that the equation (0.3) 1s equi- 
valent to the equation (0.3'). We can therefore apply Theorem 3 to the system 
(0.3), (1.2). Note that the assumptions k > 0, 0k/0v=0 are equivalent to 0y/du> 0, 
@y/ou?= 0 and this is indeed satisfied in the special case of (0.2). 


We conclude by mentioning that equation (0.3) has been studied by several 
authors in the special case m»=1. The most complete treatment is in a recent 
work of OLAINIK, KALASHNIKOV & YOoIE-LYNN [1/2], which contains references 
to earlier works. The methods of [12], however, are of very special nature. 


Prepared under ONR Contract Nonr-222(62) (NR 041 214) with the University 
of California in Berkeley. 
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Nouveau Fondement Fonctionnel de la Mécanique 


VICTOR VALCOVICI 


Mémoire transmis par C. TRUESDELL 


Introduction 


LAGRANGE dans sa célébre «Mécanique analytique» a transformé la Mécanique 
newtonienne dans un chapitre de |’Analyse. Il pensait que la transformation 
serait compléte sans aucun souvenir physique; il était méme trés fier d’avoir 
évité l’emploi de toute figure dans le texte?. 

La méthode de LAGRANGE était basée sur une nouvelle notion, celle du dé- 
placement virtuel, ainsi que sur deux axiomes: 


I. Axiome de la libération (,,Befreiungsprinzip‘‘ chez HAMEL) ow de l’existence 
(4). Il existe toujours un systéme de forces — «les réactions» — qui ont la pro- 
priété de libérer le systeme de points, des obligations imposées par les liaisons. On 
pourra représenter symboliquement, cet axiome par la relation suivante: 


[(F) + (R)] (4) ~[(F) + (#)] (4); 


ici (A) signifie le systeme de points considérés auquel on applique d’un cété, 
lopérateur (/’) de forces ainsi que celui des réactions (ft) et de l’autre, l’opérateur 
(Ff) des forces associé a l’opérateur (#) des liaisons, le signe ~ montrant que 
les deux opérations sont équivalentes. 


Il. Axtome du travail virtuel des réactions (U). Le travail élémentaire corre- 
spondant a un déplacement virtuel quelconque du systéme est égal a zéro. C'est 
grace a cet axiome que l’on peut s’assurer l’unicité des réactions. 

Moyennant la notion du déplacement virtuel et avec l’aide de deux axiomes 
on parvient a fournir a la Mécanique une position fonctionnelle susceptible 
d’étre traitée par les méthodes de |’Analyse. 

La Mécanique analytique ainsi concue par LAGRANGE représente une élégante 
conquéte des mathématiques. 

Néanmoins quelques fissures ont apparu au cours des années dans cette 
merveilleuse construction — des questions ayant un caractére paradoxal qui 
troublent l’harmonie logique de l’oeuvre. Je cite notamment: 


1 [1] Avertissement de la premiere édition. «On ne trouvera point de Figures dans 
cet Ouvrage. Les méthodes que j’y expose ne demandent ni constructions, ni raison- 
nements géométriques ou méchaniques, mais seulement des opérations algébriques, 
assujetties 4 une marche réguliére et uniforme. Ceux qui aiment l’Analyse, verront 
avec plaisir la Méchanique en devenir une nouvelle branche et me sauront gré d’en 
avoir étendu ainsi le domaine. » 

Il est 4 peine nécessaire de mentionner que depuis, les idées concernant la Mécani- 
que ont beaucoup changé: le support physique joue a l’heure actuelle un réle dominant. 
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a) Différence de méthode exigée dans le traitement mathématique des diverses 
catégories de liaisons (holonomes, non holonomes de premiére et de seconde 
classe) ?. 

b) Distinction entre les déplacements virtuels figurant d’un cété, dans 
l’équation fondamentale de la Mécanique (Principe de bp’ ALEMBERT-LAGRANGE) 
et de l’autre, dans les principes variationnels?. 

c) Déviations par rapport 4 l’axiome du travail des réactions*. 

Je me propose de substituer a la notion de déplacement virtuel ainsi qu’aux 
deux axiomes formulés ci-dessus, d’autres constructions qui soient exemptes des 
défauts signalés. 

I. Le Fondement Fonctionnel 
§ 1. Variation du mouvement 

Supposons que les positions des m points matériels A; (=1, 2,...,) ayant 
les masses m,; et constituant le systéme (A) soient déterminées par les vecteurs %; 
dans l’espace euclidien E; et leur mouvement par des relations de la forme 


r ‘s x= xi(l), (1) 
x,;(t) étant des fonctions dérivables du temps f, 
(classe C,), définies dans l’intervalle 


Ae t= [ip bls by <i. (2) 


Soit J’ Fig. 1, la trajectoire d’un point quelconque A 

Fig~1 du systéme, ayant la masse m, P étant sa position au 

moment f, et désignons par 6x un vecteur infinitésimal, 

fonction dérivable de ¢ dans tout l’intervalle (2) telle que sa dérivée soit également 

infinitésimale du méme ordre de grandeur. Le point P’ ayant le vecteur de 

position x’=x(t)+6x décrira la courbe J” lorsque P décrira la courbe I’; la 

courbe 1” sera voisine de J’. Imaginons un mobile fictif A’ sur J” qui se trouve 

en P au moment ¢’ (=/+ 6#), ol. nous avons désigné par d¢ une fonction dérivable 

de #, définie dans l’intervalle (2) et ayant l’ordre de grandeur de 6x, ainsi que 

sa dérivée. Le mouvement de A’ sur J” sera une variation du mouvement de 

A sur I’. Les deux mouvements considérés — celui de A sur J" et celui de A’ 

sur J” — seront synchrones seulement dans le cas 6f=0 dans tout l’intervalle 
(2); dans le cas contraire ils seront non synchrones. 


§ 2. Variation dune fonctionnelle 


Nous supposerons que les fonctions 6x et d¢ de ¢ sont arbitraires, en faisant 
toutefois la réserve qu’elles s’annulent aux deux bouts de l’intervalle (2). 


Le mouvement le long de J’ ainsi que le long de J” est réalisé a l’aide de 
lopérateur d, dx dt dx’ dt’ etc. 


x 


tandis que la variation de J" vers I” sera représentée a l’aide de Vopérateur 6, 


OX TOL euCe 


* [8] p. 507. 
SaSilp523. 
srlelpp 3272 uae 
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On prendra, comme d’habitude, 
ddt=4dot, (3) 


ce qui est juste si l’on admet que l’opérateur 6, appliqué a un point de J’ méne 
toujours a un point de J”. 
Explicitons maintenant l’expression 


d(fdt), 


ou l’on a désigné par / une fonction /(x, %, ¢) définie le long de J" dans tout l’inter- 
valle (2) et possédant les dérivées partielles des deux premiers ordres par rapport 
aux arguments x, %, t. Ayant égard a (3) on pourra écrire 


5 (f dt) =(d/-+1 4, 04) at. (4) 
Mais l’on a 
Of=f, 6% + f;,6% +f, 0t, (5) 
avec la notation 
h=Vit,  te=Vet. (6) 


6x signifie la variation de la vitesse v (=~) lorsqu’on passe du point P au point P’. 
On aura donc 
ee dx’ dx d(x-+ 6x) ax (S aed 5x) (1 a ‘3 st) dx 


GEN tdi OG4-O) Ped edt, Pat ke 


ou bien, en négligeant les quantités d’un ordre supérieur au premier, 


ire: Jd 
6% Fi Ox —% 7 Ot. (7) 


Dans le cas particulier de la variation synchrone (6¢=0) cette relation deviendra 
: d 
Oe == a On; (8) 


en notant avec 6)% la variation correspondant a la vitesse ¥(—v) dans ce cas 
particulier. Donc la formule (7) pourra étre mise sous la forme 


Se 
6% (= dv) = 09% — X= Ot. (9) 


Moyennant cette formule on pourra déterminer la variation de l’énergie cinétique 
T du systéme (A) de points, 
T=21)> m2, (10) 


la somme étant étendue aux m points du systéme. On aura 
él => mv ov, (11) 


ou bien, avec (9), 
: d 
6T = >) mv Oo% = Lime —- ot. 
Le premier terme de droite peut étre interprété comme représentant la variation 
6, de V’énergie cinétique T dans le cas des mouvements variés synchrones, 


DMD 0 OG (12) 
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De cette maniére l’expression de 67 prendra la forme 


éT =6,T — 27 © 81. (13) 
En revenant A la formule (7) nous la mettrons aussi sous la forme 
ba (= 60) = 4 (6x — 261) +H 08, (14) 
ot l’on a mis en évidence l’opérateur 
es 
= 6 — 6t— 1 
BO (15) 
appliqué au vecteur ¥%, 
Ax= 0% = % Ot. 


Cet opérateur trouvera un usage fréquent dans ce qui suit. Par suite, la relation 
(14) deviendra 
bi(=b0) = 7 Ax +H 60 (16) 


et la formule (4) avec (5) et (16), 


O(fdt) =f, dxdt+ ppd Ax + (t2% +7.) dt dt+ fd dt, 
ou bien 
=d(fg4x +f 6t) —Axdf;—dfdt+f,dxdt+ (f;% +7, dt ot. 


Faisons usage des formules évidentes 
af=(t,% + 12% + f,) dt, 


—f,4#0tdt+f, dxdt=f, Ax dt. 
On obtiendra 


(ft) = d(fzAx + $61) +(f,— 4 fs) Axed. (17) 


Dans cette formule on remarque aussi |’opérateur eulérien 


ga 4_ ta 
ax at Ox# 


appliqué a la fonction /; de cette facon on pourra mettre la formule (17) sous la 
forme 


b(fdt) =d(frAx + ft) + Of Ax at. (18) 


§ 3. Coordonnées généralisées. Espace de Lagrange 


En passant a l’espace de LAGRANGE le mouvement du point représentant le 
systéme (A) sera donné par les équations 


q = Hi), om OD 


q'(¢) étant des fonctions de classe C, dans l’intervalle (2). Une variation quel- 
conque de ce mouvement sera déterminée par les équations 


g=¢'), gQ=GO+oq7, f=t+, (19) 
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ou dq’, dt signifient des fonctions infinitésimales de f, ayant comme dérivées 
des fonctions également infinitésimales de t, de l’ordre de grandeur de 6q° et dt. 
En désignant par C la trajectoire dans l’espace de Lagrange (q) et par Q la position 
du point représentatif au moment ¢ quelconque dans l’intervalle (2) on obtiendra 
une variation de ce mouvement sur une courbe voisine C’ a l’aide des fonctions 
infinitésimales 6g’, dt. 


En utilisant les procédés du § 2 nous obtiendrons les formules 
6g = Ag+ Got, (20) 


5(fdt) =d(f; Ag+ fal) +0 f Agat (21) 


a la place des formules (16) et (48), f étant ici une fonction de gq, q, ¢ et l’opérateur 
eulérien ? ayant l’expression 


§ 4. Variation d’une intégrale curviligne 


Nous nous proposons dans ce qui suit d’étudier la genése des principes varia- 
tionnels de la Mécanique en partant d’un probleme de calcul fonctionnel. 
Considérons l’intégrale 


HD \aafilejiatat (22) 


dans l’espace réel. Le passage de /’a /”’ nous donnera 


6l=1(L') —1(l)=ffdt—ffdt, 
Te te 
ou bien, en faisant la substitution (19) dans J(J”’), 
dl = J o(fdt). (23) 
VE: 
En utilisant la formule (18) on obtiendra 


61 =f dfAxdt, (24) 
i, 


car Ax et 6¢s’annulent aux deux bouts de l’intervalle (2). 
Nous aurons également IIS fg as (25) 
Cc 

en coordonnées généralisées. La condition pour que la fonctionnelle J soit station- 

naire sera donc 
(Pj Axdt=0, (26) 

respectivement 
fofAqdt=o. (27) 


§ 5. Principe @ Hamilton-Ostrogradskt 


Le principe variationnel qui a rendu les plus grands services a la Mécanique 
est, sans aucun doute, celui D’HAMILTON-OSTROGRADSKI. Dans sa forme classique 
ce principe suppose que les mouvements variés admissibles sont synchrones. 


Via 
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Le mathématicien italien E. StorcH1® est le premier en date (1955) qui ait 
montré que dans le cas des forces conservatives on peut imaginer certains mouve- 
ments variés admissibles, non synchrones, par rapport auxquels le principe 
d’Hamilton-Ostrogradski soit valable. 

Les cas des forces quasi-conservatives. Nous avons donné ce nom aux forces 


admettant une fonction des forces U(x1,..., x,, ¢) qui dépende explicitement du 
temps ¢ aussi. On aura dans ce cas pour les forces Fj (¢=1, 2, ..., 2) appliquées 
aux n points A, du systéme (A), la formule 

F,= Uz 
signifiant au fond F=KU. 


Le travail élémentaire 0¥ des forces F, correspondant aux variations 6x du 
systéme aura l’expression 
OL. =) ON == DHInO ae (28) 
Mais l’on a 


80 = YU owet Or (29) 


de sorte que la formule (28) du travail deviendra 
0f =6U —U,6t. (30) 


Dans le cas des variations synchrones (6f=0) OU se réduit a 6)U et l’on 
pourra écrire par conséquent 


6U=6,U+U,6t. (31) 


Tachons d’établir une connexion entre la formule générale (25) obtenue 
ci-dessus et le Principe d’Hamilton-Ostrogradski en supposant que les forces 
sont quasi-conservatives. A cet effet nous ferons f=L dans (22), L étant 
la fonction de Lagrange 

L= lf G- (32) 
La formule (25) deviendra 


OL= (Ol Avda (33) 


Afin de trouver plus rapidement l’expression explicite de cette formule rem- 
placgons / dans (4) / par T et par U successivement. Nous aurons 


d(T dt) =(6T+T 5 4) dt, (34) 
ou bien, avec (13), 

6(T dt) = (dT — To, ot) at, (34’) 
ainsi que 

6(U dl) =(6U ee 4) dt, (35) 
ou bien 

6(U dt) =(d)U + U,dt+ US, ot) at (35’) 


eu égard aux formules 


OU =U ja, 0) oO eye 


Caw ealk 
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L’addition des formules (34’) et (35’) donnera 


O(L dt) =(0.L —E 4 0t+ Uo1)dt, (36) 
ot l’on a mis 
Hest ive (37) 


Par suite, la relation (33) deviendra 
= ; 
61 =f d.Lat— [ (E+ ot—Uoi)at. (38) 
La condition dJ=0 pour que la fonctionnelle J soit stationnaire s’écrit donc 
f %Lat— [ (E 4 0t—U,04)dt=0 (39) 
0 dt t 2 


Cette formule représente la forme généralisée du principe d’Hamilton-Ostro- 
gradski; car elle prendra la forme classique de ce principe 


fw Ogle d= (40) 
et partant, l’expression 
feieaé 


aura une valeur stationnaire si l’on peut disposer des variations 0x, 6¢ afin que 
la condition 


: | 
f (Eg ot — Ge) at =0 (41) 


soit remplie. Les variations non synchrones correspondantes représentent les 
mouvements variés admissibles pour le principe d’Hamilton-Ostrogradski, dont 
les mouvements variés synchrones de la forme classique en forment un cas 
particular. 

Cette importante observation a été faite pour la premiére fois par E. STORCHI®. 
Celui-ci a limité l’observation au cas particular des systémes conservatifs 
(U,=0); pourtant elle peut étre étendue aux systémes quasi-conservatifs, comme 
on a vu ci-dessus, avec la condition (41). Par ailleurs, on peut en déduire celle 
donnée par STORCHI en y faisant U,=0. 


$6. Principe de moindre action 
En prenant {= 27 dans (26) on arrive au principe de moindre action (MAu- 


PERTUIS) sous la forme 
f 2T dt = stationnaire. (42) 


Laissant de coté le facteur 2 et posant 
I=fTat (43) 


on arrive avec (34) a la condition 


[ (oT +7 G, 64) at=o. (44) 


6 [12]. 
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Donc il suffit de déterminer une fonction 6¢ satisfaisant a la relation (44) pour 
que le principe (42) soit valable. Moyennant Videntité 


d OR ae 
—W ig T ot 
oe Cie —w a 6t+To 
nous ferons une intégration partielle en (44) du dernier terme. On aura 
fOr Te d=o, (45) 


vu que df est nul aux deux bouts de l’intervalle (2). Mais si le systeme admet 
le théoréme de l’énergie sous la forme 


FS Fo 
la condition (45) deviendra 
f (6T — > Fv 6t) dt=0. (46) 


Il s’ensuit que le principe de moindre action sera valable pour tous les mouve- 
ments variés satisfaisant a la condition (46), en particulier, pour les mouvements 
variés satisfaisant a la condition 


6T = Fvdt. (47) 


$7. Autres principes variationnels 


La relation (26), respectivement (27), pourrait engendrer d’autres principes 
similaires, en dehors des deux classiques bien connus, en prenant pour / d’autres 
fonctions en dehors de L et 27; par exemple, f=U ou /=E. Dans le premier 
cas (f= U) la formule (35) donnerait 


f(ou+ U «oi dt=0, (48) 
une condition pour 0¢ afin d’obtenir le principe 
Jf Udt=stationaire. 
Dans le second (f= £) la condition correspondante pour df serait 
[ (02+8 4, 0t)dt=0, 


ce qui conduirait au principe 
J E dt =stationnaire. 


Si on prenait d’une maniére plus générale 
f=%,T+a,U, 


%, % etant deux constantes arbitraires on pourrait obtenir une infinité de prin- 
cipes variationnels’. 


7) + - L . ) L t - -) 
Dans [16] j’ai exposé aussi d’autres méthodes pour générer des principes nouveaux. 
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II. Equation Fondamentale de la Mécanique 
§ 8. Condition de liaison 


On entend d’habitude par condition de liaison une entrave de nature géo- 
métrique ou cinématique dans le mouvement du systéme (A). 

Une analyse plus détaillée du mouvement’ met en évidence l’importance du 
vecteur dx (=v dt) qui représente le déplacement réel du point considéré dans 
lintervalle dt de temps. L’élément dx est pour ainsi dire la «cellule constitutive » 
du mouvement «global» du point. Il s’ensuit qu’une «condition de liaison» 
imposée au point A devrait attaquer en premier lieu la liberté de l’élément dx; 
subséquemment celle des éléments x, ¥%, ¥, ¢. Néanmoins dans la forme classique 
de la Mécanique on met l’habitude les conditions de liaison sous la forme ® 


p(x, %, x,t) =0, (49) 
ce qui n’est pas normal. II] serait plus naturel de les représenter par des relations 
y(dx, dt) =0 (50) 


qui lient entre eux les paramétres essentiels du mouvement, dx et dt, les autres 
éléments du mouvement x*, %, ¥,f pouvant y entrer aussi. Mais la nature in- 
finitésimale de l’équation fondamentale, obtenue a l’aide du principe de d’Alem- 
bert-Lagrange impose a la relation (50) une forme linéaire 


adx+Pdt=0, (51) 


le vecteur « et le scalaire 6 étant des fonctions de x, x, ¥,¢ et adx un produit 
scalaire. Pour éviter toute erreur il faut accentuer cette propriété de « et B 
quils ne sont pas astreints a dépendre seulement de x et ¢, comme on suppose 
dans la forme classique de la Mécanique, mais des vecteurs %, ¥ aussi par leurs 
composantes 1°. Avec ce complétement on peut affirmer que les formes linéaires 
(51) comprennent les conditions les plus générales (holonomes ou non holonomes) 
qui pourraient intervenir dans la réalité!?. 


En employant des indices, on aura donc pour les m liaisons linéaires 
aidx;+ B'di=0, PEDO. es OTN ORIN (52) 
nous avons supprimé le signe >) qui devrait précéder le premier terme — ce que 
i 


nous allons faire en général, dans la suite. Comme conséquence de la forme (52) 
adoptée pour les relations de liaison, on ne sera plus obligé d’employer des mé- 
thodes différentes pour traiter les liaisons holonomes et celles non holonomes; 
elles seront traitées toutes par une méthode unitaire. 

Nous supposerons en outre que les m relations scalaires (52) sont distincte 
Elles fourniront donc au probléme analytique m relations distinctes entre les 
éléments x, x, ¥,£ du mouvement, car en y remplagant dx par % dt on aura 


o£; ap! = 0; (53) 


g 10. 


8 [16]. 

En nous limitant aux liaisons bilatérales; d’ailleurs les liaisons unilatérales 
(représentées par des égalités et des inégalités) se réduisent a celles-la. 

LOMO 
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§ 9. Equation fondamentale 
Le principe de d’Alembert-Lagrange qui coincide avec le principe que Lagrange 
appelle des «vitesse virtuelles», appliqué a la Dynamique, nous fournit /’éqguation 
fondamentale de la Mécanique 


(— m,xX,+ F) 6x,=0, (54) 
oul nous avons supprimé, comme ci-dessus, le signe va Nous écrirons cette équation 
sous la forme abrégée DidisZ os ; (55) 
avec la notation oe poe (56) 


Nous nommerons le vecteur ® «vecteur d’Alembert ». 

Maintenant surgit une question qui présente une grande importance pour la 
résolution du probléme: quelle est la répercussion que les conditions de liaison 
(51) doivent avoir sur les déplacements virtuels ? 

La forme classique de la Mécanique a donné a cette question la réponse 
connue: les déplacements virtuels 6x; devraient satisfaire aux relations que l’on 
obtiendrait de (52) en y faisant dx;=6.x,; et en y considérant ¢ comme constant, 
c.a.d en y faisant dt=0. I] s’ensuit que les déplacements virtuels 6 x; admussibles 
pour l’équation fondamentale (55) devraient satisfaire a la condition 


al Ox, =0. (57) 


La méthode des multiplicateurs de Lagrange livrera dans ce cas la solution du 
probleme sous la forme du systéme d’équations 


i 


auquel on adjoint le systeme (53). Le systeme est donc déterminé, le nombre 
des équations étant égal a celui des inconnues *;, A;. 

La justification physique du passage des relations (51) aux conditions (57) 
imposées aux déplacements virtuels 6x, admissibles est basée sur la propriété 
axiomatique des réactions R,, Reo (59) 
et celle-ci sur le «fait avéré» — reconnu comme tel — que le mouvement du 
point obligé 4 se mouvoir sur une surface immobile et parfaitement lisse (dé- 
pourvue de frottement) subit de la part de la surface une réaction normale. 

Il est évident que dans ce cas particulier la relation (59) est satisfaite si les 
déplacements virtuels admissibles 6x, sont limitées aux déplacements sur la 
surface méme. Or cette derniére condition s’exprime par une relation de la forme 
(57) et celle-ci provient de (52) en y faisant dt=0, dx,=0x,. 

La relation (59) apparait donc comme une extension naturelle du phénoméne 
correspondant a la surface immobile et dépourvue de frottement. 


§ 10. Désaccord entre l’équation fondamentale et les fondements fonctionnels 
de la Mécanique 


L’examen mathématique du fondement fonctionnel de la Mécanique conduit, 


comme on a vu ci-dessus, a la relation (24) qui contient Ax a la place ou devrait 
figurer Ox. Cette substitution 


Ax >6dx (60) 
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rend inutilisable l’élégant procédé des multiplicateurs si le probléme comporte 
des relations de liaisons ayant la forme (52) avec f/=0 (systéme rhéonome). 
Il y a donc un désaccord entre |’équation fondamentale, d’un cété, et la base 
fonctionnelle de la Mécanique, de l’autre, non seulement a cause de cette inter- 
diction de méthode; car on constate aussi que la division infinitésimale du 
probleme de la Mécanique emploie des unités différentes selon que cette division 
est réalisée moyennant le principe des déplacements virtuels ou bien moyennant 
l’ensemble des mouvements variés. 


§ 11. Variations virtuelles (possibles) 


Afin de supprimer le désaccord mentionné ci-dessus nous allons essayer de 
remplacer la notion classique de «déplacement virtuel» 6% par une autre plus 
convenable. Nous avons été conduits a cette idée aussi par la mauvaise réputation 
dont jouit la notion du déplacement virtuel; car cette notion était depuis longtemps 
reconnue comme une notion artificielle, presque inadmissible 4. Méme Gauss 
en s’y rapportant dit que cette notion ne posséde pas de crédit pour pouvoir 
étre recommandée comme plausible !”. 

D’ailleurs on avait essayé méme avant nous de la modifier. Le mathématicien 
italien Luici CAsTOLDI a proposé en 1949! une nouvelle définition pour le 
déplacement virtuel, en utilisant les «variations» bien connus de JOURDAIN. 
Mais malgré sa complication cette nouvelle notion ne parvient pas a mettre 
d’accord les deux quantités fondamentales, 4x et dx. 

Le moyen le plus simple.pour y arriver serait de faire la substitution (60), 
voila comment. 

Nous considérons 6x et 6¢ comme des variations arbitraires des vecteurs % 
et du temps ¢. Elles seront fosszbles (ou virtuelles)'* seulement si elles satisfont 


aux relations a! dx, + pidt=0 (61) 


analogues aux conditions de liaisons (52). Parmi les variations virtuelles il y 
aura évidemment, le déplacement réel du systéme — c.a.d. dx, di — ainsi que 
le déplacement virtuel de la Mécanique classique pour 6¢=0. II est clair que 
les «variations virtuelles» ont un degré de liberté en plus, si on les compare aux 
déplacements virtuels classiques satisfaisant a la condition (57), da a l’apparition 
de l’élément arbitraire 6¢ dans leur composition. 

Pour distinguer les déplacements virtuels de la Mécanique classique (d¢=0), 
des variations arbitraires 6x nous allons les désigner par 09%. 

Maintenant nous allons donner aux relations (61) une autre forme afin de 
rapprocher les «variations virtuelles», des quantités 4x, en remplagant f’ par 
l’expression pi a cag! i, 


11 [5] p. 49. 

Hes & Dee eigentiimliche Charakter des Prinzips der virtuellen Geschwindigkeiten 
besteht darin, da es eine allgemeine Formel zur Auflosung aller statischen Aufgaben, 
und so der Stellvertreter aller anderen Prinzipe ist, ohne jedoch das Creditiv dazu so 
unmittelbar aufzuweisen, da es sich, so wie es nur ausgesprochen wird, schon von 
selbst als plausibel empfohle.“ 

SBD Noy (WHA 

14 En langue russe on traduit «déplacement virtuel » par vozmojnoe pere mechnie 
c.a.d. «déplacement possible ». 
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tirée de (53). Par suite de cette substitution la relation (61) deviendra 
ol Ox; — of x, t= 0, 


ou bien, ai Ax, =0, (62) 


en utilisant l’opérateur A défini par la relation (15). 

Cette égalité nous incite A essayer la substitution de Ax partout a la place 
du déplacement virtuel 6x, comme cela se voit au passage (57)—>(62); par consé- 
quent, A essayer de remplacer la notion de «déplacement virtuel dx» par la 
notion de «variation virtuelle 4x». 

On serait arrivé de cette facon a la substitution (60) que nous avions en vue. 


L’accord entre les deux aspects de la Mécanique, — celui imposé par l’équation 
fondamentale et celui des principes variationnels —, est ainsi réalisé et par suite, 
la méthode des multiplicateurs sera également applicable dans le cas des principes 
variationnels aussi. Quant a l’équation fondamentale elle prendra la forme 


@,; Ax;=0, (63) 


au lieu de (55); associée aux relations (62) elle menera aux mémes €quations 
(58) de sorte que le systéme des équations conservera la forme du cas classique. 
La substitution (60) ne change donc rien a la solution, tout en réalisant l’accord 
entre la forme de l’équation fondamentale et celle du probleme fonctionnel. 

Il nous reste encore une chose capitale a accomplir. I] faudra justifier, au 
point de vue physique, la substitution (60) dans l’équation fondamentale, c’est- 
a-dire le passage (55)—>(63). A cet effet il sera nécessaire de reprendre les deux 
axiomes concernant les réactions. Nous maintiendrons invariable le premier 
axiome (J), de la libération. Quant au second (U) il subira une certaine trans- 
formation de contenu. 

Commengons pa la notion du «travail virtuel des réactions » qui aura maintenant 
une nouvelle définition conformément a |’élément 4x remplacant le déplacement 
virtuel 6x. Nous appellerons travail élémentaire des réactions correspondant a 
une variation virtuelle dx, dt expression R; Ax;. 

Comparée avec l’expression classique R; 6x; du travail, cette nouvelle ex- 
pression R; Ax; posséde un terme en plus dti au déplacement additionnel 6, x; 
(=—,0t) du point considéré avec sa vitesse réelle prise en sens opposé (— %;) 
et pendant l’intervalle de temps 6¢. Donc le travail sera égal 4 R;(6x;+6,%,) 
ce qui signifie que le travail virtuel dans sa nouvelle acception tient compte 
a la fois des deux sortes de paramétres: spatial (6x;) et temporel (62). 

Avec cette modification de la notion du travail virtuel le second axiome peut 
€tre maintenu dans la forme que nous lui avions donnée au commencement: 

II*. Le travail virtuel correspondant a une variation virtuelle Ax; quelconque 
du systéeme est égale a zéro. 


Nous aurons donc 


R,Ax,=0 (64) 
a la place de (59). Mais la réaction R; est égale au vecteur @,;, donc 


Ce sera la nouvelle forme de l’équation fondamentale. 
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I] faut pourtant se rappeler que la relation (59) — remplacée ici par (64) — 
6tait considérée comme possédant un puissant appui physique dans le fait que 
la réaction d’une surface immobile aura la direction de la normale, s’il n’y pas 
de frottement — phénoméne considéré comme un cas particulier du cas général 
(rhéonome). Mais ce phénoméne pourra continuer a jouer le réle de puissant 
appui physique pour la nouvelle relation (64) aussi, car il peut étre considéré 
également comme un cas particulier de la relation (64); donc, cette derniére — la 
relation (64) — pourra toujours apparaitre comme une extension — plus évoluée — 
de celle qui correspond au cas scléronome de la surface immobile. 


Conclusions 


On voit de ce qui précéde que |’élégante construction de la Mécanique ana- 
lytique reste en vigueur telle qu’elle a été imaginée par LAGRANGE, dans les 
grandes lignes de son contour. 

Néanmoins, afin d’obvier a certaines aspérités qui apparaissent au cours de 
son développement il est de rigueur d’entreprendre une révision de sa structure 
axiomatique qui est a la base de cette Mécanique devenue classique; en d’autres 
termes, les fondements méme de la discipline devraient étre de nouveau examinés. 

Dans ce but nous avons commencé par établir une base fonctionnelle trés 
large, au service des principes variationnels de la Mécanique, en leur fournissant 
ainsi un mécanisme commode et général de déduction. 

On a vu de cette maniére que les deux principes classiques — «b’ HAMILTON- 
OSTROGRADSKI» et de «moindre action (MAUPERTUIS)» — peuvent étre étendus 
a des mouvements variés admissibles constituant un ensemble bien plus puissant 
que son correspondant de la Mécanique classique. 

Nous avons montré a cette occasion que le méme mécanisme est en état 
d’engendrer une infinité d’autres principes variationnels en dehors des deux 
classiques bien connus. 

Cet examen fonctionnel nous a conduit d’une maniére naturelle a un nouvel 
élément différential, Ax, qui figure méme dans la structure organique de la 
fonctionnelle. 

L’apparition de Ax au sein de la fonctionnelle trouble profondément le manie- 
ment des principes variationnels au cas ot il existe des liaisons; car |’élément 
différential 6x employé dans le principe de d’Alembert-Lagrange comme dé- 
placement virtuel ne s’harmonise pas avec l’élément 4x de la fonctionnelle. 
Pour éviter l'état déséquilibré qui en résulte nous avons été amenés a remplacer 
la «déplacement virtuel» de la Mécanique classique par la «variation virtuelle» 
et partant, 6x par Ax. 

Ce nouvel élément, 4x, implique un élargissement de l’ancien, dx, ayant 
un degré de liberté en plus et pouvant en conséquence, satisfaire les exigences 
logiques d’une maniére plus convenable. L’opérateur A qui définit le nouvel 
élément, inclut aussi le temps, de sorte que |’élément lui-méme dépendra égale- 
ment du temps #, non seulement de x. Le temps constant qui était imposé dans 
la Mécanique classique pour arriver a la définition du déplacement virtuel n’est 
plus ici nécessaire pour la «variation virtuelle» — et celle-ci remplace le déplace- 
ment virtuel dans toutes ses fonctions. 
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Par ailleurs, la solution de l’équation fondamentale — c.a.d. celle qui re- 
présente le mouvement du systéme — reste invariable: elle reproduit exactement 
la valeur fournie par la Mécanique classique, comme d’ailleurs la valeur des 


réactions aussi. 
Mais afin de réaliser cet échange entre 6x et Ax on a di modifier aussi 


Vaxiome du travail virtuel des réactions en le mettant d’accord avec la sub- 


stitution (60). 

Ces modifications qui réparent les fissures signalées au début de ce travail 
ont été accomplies sans violer aucunement l’allure physique des phénoménes. 
L’ensemble de toutes les notions dans leur nouvelle forme nous apparait comme 
une extension naturelle des notions classiques — extension qui améne avec elle 
un completement de méthode ainsi qu’un complément de pouvoir opérateur. 
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Further Symmetry Relations 
for Transversely Isotropic Materials 


J. E. ADKINS 


Communicated by A. E. GREEN 


1. Introduction 


In earlier papers (ADKINS 1958, 1960) forms of the stress deformation relations 
have been investigated for a class of transversely isotropic bodies in which the 
stress components are assumed to be polynomial functions of the kinematic 
tensors defining the deformation and its successive time rates of variation. When 
an arbitrary number of kinematic tensors is involved, a preliminary reduction 
of the expression for the stress is achieved by using the results of classical invariant 
theory. The resulting expression is reduced to closed form by using the Cayley- 
Hamilton theorem and matrix relations derived from this by RIvVLIN (1955) and 
by SPENCER & RIVLIN (1959a, b). 

A feature of the analysis for transversely isotropic bodies is the employment 
of two alternative systems of invariants. In one of these, the kinematic tensors 
are regarded as a system of symmetric 3 x3 matrices; in the other, these tensors 
are decomposed into a system of symmetric and unsymmetric 22 matrices. 
This latter resolution enables rapid reductions of the basic invariant system to 
be achieved by using comparatively simple results which depend upon the Cayley 
Hamilton theorem for a 2x2 matrix. Relations between the two invariant systems 
are readily obtained by using the symbolic methods of classical invariant theory 
(WEITZENBOCK 1923, TURNBULL 1928). 

In the present paper the analysis is extended to include the case where the 
stress matrix is a polynomial not only in the elements of a set of kinematic 
tensors but also in the components of a number of vector fields. Attention is 
here confined to the invariance problem which is thus introduced and the possible 
physical significance of such vectors is not examined. It is feasible, however, 
that they could arise as the result of electric or magnetic fields, non-uniform 
temperature distributions or the successive time rates of change of quantities 
of this type. When attention is confined to polynomial representations a di- 
stinction may be made between materials which exhibit only rotational symmetry 
about a given direction at each point and transversely isotropic materials in 
which there is also symmetry with respect to planes containing this axis. These 
two cases are examined separately. As in the earlier work, convected coordinate 
systems are employed; this ensures that the necessary conditions are satisfied 
for invariance under rigid body motions of the entire system (see, for example, 
OLDROYD 1950). 


264 J. E. ADKINS: 


2. Mechanical properties dependent upon vectors and tensors 


We consider a homogeneous body which is at rest at time /=0 and which 
subsequently undergoes a continuously varying deformation. A system of curvi- 
linear coordinates # is associated with elements of the material and moves with 
the body as it is deformed. The covariant and contravariant metric tensors for 
this system at the initial time t=0 are y;;=y;,;(8", 0) and y'’=y""(8", 0) respec- 
tively, the corresponding quantities at the current time ¢ being [,=0,, (0, b) 
and [%/=J"1(’, t). In terms of these quantities the covariant strain tensor 7; 
and rate of strain tensors ol”? (y= 4, 253.-%) sare defined by 


ig 3 (Li; a Vi) , 
(yy) Ding 1 DL; 


(eee RES HENS) 24 
i eal 6 cae ala) ee 


where D/Dt denotes differentiation with respect to ¢ holding the convected 
coordinates # constant. In addition to these kinematic tensors, we suppose 
that the mechanical properties of the body depend also upon a number of vector 
fields v, (Y=1, 2,..., ), the covariant components of the vector v, referred to 
the system # being vu”. These vector fields may arise as the result of magneti- 
zation, electric polarization or other physical phenomena; some may be intro- 
duced as the gradients of scalar fields such as temperature while others may 
enter as the time rates of change of physical vectors. In the latter case such 
time derivatives would be formed with the operator D/D#, the process being 
analogous to that defined by (2.1). We are not here concerned with the exact 
nature of the fields which can occur or their significance; we examine only the 
invariance problem which arises from the assumption that they are present in 
the expression for the stress tensor. The contravariant components of this 
tensor, referred to the system #", are denoted by 1”. 

In considering properties of the undeformed body it is convenient to form 
contravariant, mixed and covariant components of the kinematic and mechanical 
tensors and vectors already defined by raising and lowering indices with the 
initial metric tensors yl»; ;- When the coordinates 8 coincide with a rectangular 
Cartesian system x*(=.;,) in the undeformed body at time ‘=O, vij=y" = 06;; 
and the corresponding components of any given tensor are equal. We denote 
by e,;, a\?, t'’ the components of the tensors ni;, «?, vl respectively, and by 
uv” those of the vector v, when #'=.x;. We define a homogeneous, uniformly 
aeolotropic material by the tensor relation 


ti = fille... Gers iss Oe: Daa) (ff? = pf) (2.2) 
where the functions /’ are polynomials in the arguments indicated. Any co- 
efficients occurring in these polynomials are material constants which are inde- 
pendent of position throughout the material and do not depend upon the defor- 
mation. This proviso, together with the fact that all kinematic and mechanical 
tensors occurring in (2.2) are referred to the initial Cartesian system «,, is sufficient 
to ensure that the material is uniformly aeolotropic. The use of convected co- 
ordinates ensures that the relations (2.2) are invariant under all rigid body 


Transversely Isotropic Materials 265 


motions of the entire mechanical system. This aspect has been examined by 
OLDROYD (1950), Rrvitn & ERICKSEN (1955), GREEN & RIvLIn (1957) and others. 
If # is a convected system not coincident initially with the Cartesian system ¥,, 

we obtain by a tensor transformation 
need jot 
Ox? dOxs 


ENGI (2.3) 


where 


€ vA i 
‘eukes 1 (= Ob Oo oe): (2.4) 


39 Gar das  Ox® xt 
From the components e;,, al’) of the kinematic matrices we form the symmetric 
matrices 

E=|le;;|],  A,=|la/? ll, (2.5) 
and for a given pair of values of 7, 7 the transformation matrix 

C=|C,,|| (Gees. (2.6) 


Symmetry properties of the material are investigated by examining the behaviour 
of the relations (2.2) under rotations and reflections of the x,-axes in the unde- 
formed body. Such transformations are defined by subgroups of the full orthogonal 


group 
x4 = M;; Xj M; 


vk 


M, = M,;M,;= 4; (2.7) 


44? 


and under all such transformations the matrices H, A,, C behave similarly. 
We may therefore write 
Nii = M,;, M,, Xrss (2.8) 


where y,, may represent elements of any one of the matrices (2.5) or (2.6). 
Similarly for the vectors », we have 


tk vy. (2.9) 


3. Transversely isotropic bodies 
As in the previous paper (ADKINS 1960), general forms of the stress defor- 
mation relations for transversely isotropic bodies are deduced from results in 
classical invariant theory. We consider the behaviour of the system of linear 
and quadratic forms 


FF, = A; %;, Fy = B;%;,..., = N;,%;, 


a (3.1) 
A= Ajj Xj Xj, f= by; Xi X;, wey tin = My; %4 Nj (7,7 = 1, 2, 3) 
under the group of orthogonal transformations 
X= %4, X= M, 6%; (3.2) 
where 
Mo M,,= Mz May = 9c; |M.l|=+14, (3-3) 


and here and subsequently greek indices are restricted to take the values 2, 3 
while latin indices cover the full range 1, 2,3. Under the transformations (3.2) 
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the forms (3.1) become 


where 


ay, = %1, a,=M,%,, Bio = My, Mica» 
and we have similar results for the coefficients in 
Kin aioe 


For the examination of invariance properties, it is convenient to use the 
symbolic representation of classical invariant theory for the coefficients of the 
quadratic forms. We therefore write 


Tet, moot ; 
Qj = jj, = YA = 4 A = 4; Die TS (3.7) 

. e . ‘ iad 
with corresponding expressions for b;;,..., m;;. The symbols a;, a;, 4; ,..., Mj, 
m,,m;,... and the coefficients A;, B;,..., N; behave similarly under corre- 


sponding transformations of the orthogonal group. In particular, for the subgroup 
defined by (3.2), (3.3) we have 
Ga tee PU Oe (3.8) 


corresponding to the first of (3.6). Furthermore, any function of the coefficients 
a;;,0;;, +++, m;; May be represented in terms of the corresponding symbols and 
this symbolic representation gives a unique determination provided different 
symbols of a given set are used to represent a given coefficient each time it 
occurs. For example 


r , , ed 
Aji Ag7 = A,A;, AZ A;, = Az Ajj =A, 4,4; a}. 


To the system of forms (3.4), (3.5) we adjoin the linear form 
L=1,%,=%, (3.9) 
where 
(21, 22, 43) = (4, 0, 0). (3.10) 


It then follows from results given by WEITZENBOCK (1923) and others that any 
polynomial in the quantities A;, B;,..., Nj, a;;,0;,,..., m;; Which is invariant 
under all transformations of the group (3.2) can be represented symbolically 
as a sum of products either of the quantities 


&1 b: & 
(Sno) =)m me ms}, (E72), — (3) 
Ci fe Cs 
wit 
(Fy) =n, (61) =§1,=&, (ii) oie 
or of the expressions 
(Ent), (&ly)=E.n, (EI) =. (3.12) 


In (3.41), (3.12) &;, 4;,¢; may represent any of the symbols a,, a;;, aj, ..., ;, mi, 
m;,... or the coefficients A;, B;,..., N;. Relations between invariants formed 
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from the systems er (3.12) may be obtained by writing 


= (€|7) + (€2) (04), 
os = (Ent) (00) — (E60) (mq) + (gd) (Z€). 


If we consider polynomials in the coefficients of (3.1) which are invariant 
not only under transformations of the group (3.2) but also under the reflection 


(3.13) 


(%1, %2, X3) = (%, Xz, — %g), (3.14) 


the determinants (€7¢), (7 /) must be omitted from the symbolic representations. 
In place of (3.11), (3.12) we then have 


(€7)=&n,, (1é6)=&, (3.15) 
(€[n)=€.., (26) =&, (3.16) 


respectively. 

From the definitions of § 2 it follows that the coefficients a;; and the elements 
of the matrices C, E, A, behave similarly under all transformation of the ortho- 
gonal group; the same is true of the coefficients A; and the vector components vw”, 
Furthermore, the transformations (3.2) and (3.14) may be used to describe a 
rotation of the x,;-axes of §2 about the x,-axis and a reflection in the %,, %- 
plane respectively. It follows that if from the symbolic quantities (3.11) or (3.12) 
we construct invariants of the coefficients in (3.1), and in the resulting expressions 
replace a;;, b;;,...,m;; by elements of C, E, A, and A;, B;,..., N; by elements 
of v, we obtain a system of invariants and coefficients appropriate to materials 
possessing the rotational symmetry described by (3.2). A similar procedure based 
on (3.15) or (3.16) yields the corresponding system for bodies transversely iso- 
tropic relative to the x-axis, these being characterised by the transformations 
(3.2), (3.14). 

Since the product of two symbolic determinants (€ 7 ¢) (&''¢’) can be ex- 
pressed as a polynomial in the scalar products (ii) of (3.11) it follows that any 
polynomial invariant of the rotational symmetry group formed from the quan- 
tities (3.11) can be represented symbolically as 


(Ent) ht (én!) ik+TJs, 


where J,, Jz, J3 are polynomials in the scalar products (£7), (€/). Similarly for 
any polynomial invariant formed from the quantities (3.12) we may obtain a 
symbolic representation 


(Eni) fi +he: 


where f;', fk are polynomials in the products (&|7), (€/) alone. Evidently, if the 
polynomial basis is abandoned, the factors (7 ¢), (7!) may be omitted and 
the invariants for the rotation group may be constructed from the quantities 
(3.15) or (3.16). 
4. Some relations for matrices 

For subsequent reductions we require the following results in the theory of 
matrix algebra which are a consequence of the Cayley-Hamilton theorem and 
are based on analysis given by RIVLIN (1955): 
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(i) Any matrix polynomial P in 2x2 matrices B, (k==1,°20017 yn)rean" be 
expressed in the form 


P= 1+ 2,9; Bid 2 074 By By, (4.1) 
7=1 Hid iS 


where @, 9; and ;,, are scalars expressible as polynomials in the elements of 
B, and I is the unit matrix. If P is homogeneous and of degree m in the elements 
of the matrices B; then each term of the right hand member is also homogeneous 
and of degree m in these elements. Furthermore, from the method of proof 
given by RIvrin, it follows that if P is a matrix product of the form BY By. me By 
where «;, %;...%, are positive integers, then %, 9;, Pj, are all polynomials in 
traces of products and powers of the matrices B,. 


(ii) Any invariant J of the form tr HB, Ee formed from the system of 
2x2 matrices B; is expressible as a polynomial in 


trB;, trB;B;,  trB;B;B,. 


This follows immediately from (i) by forming tr B; P and writing P=B,... B,. 
This gives 

I= tr B;+), 9,1 B;B,+ >, 2d 9j,tt B; B; B,, 
j=1 7 


j=1 k=1 


and if I is of degree m(>3) in the elements of B,, B,,..., B, the invariant 
functions @, ~;, Pj, are of degrees less than m. Repetition of this procedure 
gives the result stated. 


(iii) Any polynomial invariant formed from the elements of a system of 2 x2 
symmetric matrices B, (k=1, 2,...,) can be expressed as a polynomial in 


LE B,, tr B; B,. 


(iv) Any polynomial invariant formed from the elements of a system of 2 <2 
symmetric matrices B, which involves also the elements of an unsymmetric 
matrix B linearly can be expressed as a polynomial in 


tr.B,, ir BSB, tr B,. tt i. , ¢ thie 


This result follows as a corollary from (ii) and (iii). 
(v) If A and B are two 3 x3 matrices then 
ABA=— A?B— BA? + P(2), (4.2) 
where P(2) is a matrix polynomial of degree two in A, B, the coefficients in this 


polynomial being scalar polynomial invariants formed from the elements of A 
and B. 


The results (i) to (iv) depend upon the Cayley-Hamilton theorem 
A?=AtrA+4I[tr A? — (tr A)?] (4.3) 


for a 22 matrix A; (v) depends upon the corresponding result for a 3 x3 matrix. 
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5. Reduction of invariants for transversely isotropic bodies 


From the symbolic products (3.15) we may construct invariants of the following 
types: ; uM 
Oe, Dig Lim Lands (1) 415054 +++ Crmtmy> (ii) 
Aj 055 Ojy.. Cmtnr: (111) Ai, jess Gen bins ea (iv) 


Type (i) arises from products of the factors (€7) alone in which each symbol 
occurs twice; these symbols therefore combine, as in (3.7), to yield coefficients 
a;;,0;;,...,m;; of the quadratic forms in (3.1). Type (iv) also arises from the 
products (7) but two of the symbols in this product occur only once giving 
two coefficients of the linear forms in (3.1). In the symbolic forms representing 


(5.1) 


(ii) there are two factors (&/) containing /, and each symbol , 7, ... occurs twice; 
in (ili) there is a single factor (€ 7) and one of the symbols &, 7, ... is not repeated. 
The coefficients A,, B,,..., Ny, 41, 0,1,---, M1 are also invariant. It is readily 


seen that products containing more than two factors (€/) can be written as 
products of simpler invariants of the types (5.1); a similar remark applies to 
products in which more than two of the symbols £, 7, ... occur only once. From 
the symbolic products (3.16) we may construct the invariants 


Veep Opy sasha tre (i) Ay aps Coatar, (11) 


¥) (5.2) 
Ag Gy 6 Og ++<Cog tay, ~ (ill) Ay Gp Opy ++ Catan Ey, (iv) 


analogous to (5.1). 


The invariant system (5.1) may be reduced further by making use of the 

Cayley-Hamilton theorem for a 3 x3 matrix and the generalizations derived by 
RIVLIN (1955) and by SPENCER & RIVLIN (1959a, b, 1960). Reductions of the 
system (5.2) are based on the Cayley-Hamilton theorem for a 2x2 matrix. 
Expressions for the invariants (5.1) in terms of (5.2) are obtained by converting 
each of the coefficients into symbolic notation and expanding the expressions 
obtained with the help of (3.13); the converse procedure yields expressions for 
the invariants (5.2) in terms of (5.1). Relations of this kind have been derived 
in an earlier paper (ADKINS 1960) and a general method of reduction of the 
invariants (i) and (ii) of (5.1) and (5.2) has there been indicated. We are here 
concerned mainly with the additional invariants arising from types (iii) and (iv) 
and consider first the forms (5.2). 
We allow a;;, 0;;,..., 4;, B;,... to denote typical coefficients of the forms 
(3.1), it being understood that different letters denote coefficients of different 
forms, no two forms being identical. From these coefficients we construct a 
system of symmetric 2 <2 matrices 


b= ||aael|, = l]%10 %:l| (1 =1,2,...,m), (5.3) 
a;=||A, Ag] @= 4,2... 
and a system of unsymmetric 2 x 2 matrices 
b;, = ||410 Bll Cie DEM. hea b), 
Ci, = || 40 41|| (Deh Der Lae ees AOL, 7)! (5.4) 
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We thus obtain m, m and n matrices of types b,, ¢;, d; respectively and 3m (m— 1), 
mn and 4n(n—1) of types B,,, ¢;, and d;, respectively. It is sufficient for the 
subsequent analysis to omit the matrices ||, ||, ||B.Ag|| which are the 
transposes respectively of b;,, @;, in (5.4). 

From (5.2) we observe that any polynomial invariant formed from the co- 
efficients of (3.1) is expressible as a polynomial in the traces of products of the 
matrices (5.3), (5.4) which either involve the set b; alone or at most one of the 
matrices ¢,, d;, b;,, €;,,4;, linearly. It follows at once from the results (ii) 
and (iv) of § 4 that each of the invariants (5.2) may be expressed as a polynomial 
in* 

41> Dea a» yp Ope.» Ay 4, Oy1; (i) 

Ay Ong opi» ii) 
Aya xp OBy Cyr (@ 20, ae) j 
Ay Ong py Ay1 (a,b,c,d all different),  ( 


( 
(111) 
A, Ag Gq1, Ag 4p 581, (v) 
( 
( 


iv) 
Ay yp Ogy Cy1 (i= Dee Ch, vi) 
AB, A, Bg ays; 

A, Bg apy Oya (A= Dae mh) . 


(vil) 
Vill) 
The restriction a=£0 in (iil), b=:c in (iv) and a= in (vi) and (viii) follow as 
an immediate consequence of the Cayley-Hamilton theorem (4.3). To obtain the 
remaining restrictions we need to use (4.2). For example, if we express the 
invariant (vi) in terms of those of the system (5.1) by converting into symbolic 
notation and using (3.13) we obtain 
Ag yp bpy Cy = (A| 4) (a| 6) (b| c) (c2) 
= [(Aa) — (AJ) (al)] [(ab) — (a2) (62)) [(b¢) — (62) (cZ)} (c2) 
= A; 4;; Dik Cha — Aya; b;; Cj, — Ajay; Cir (5.6) 
== Ay aj Diy Cyy + Ay ayy O45 Cy + 
+ Ay 5 Diy Oy + AG Gy Oy Oy — Ay yy Oy 11 
If a;;=c;; the first term in the expansion (5.6) may be reduced by means of 
(4.2) giving 
Aj 45544 O41 = — Aj O55 jy On, — A505; GE Mer + VY, (5.7) 


Veet) 


where ¥ is a polynomial formed from invariants of the second degree or lower 
in 4,;,6;;, these invariants being irreducible. With the help of the symbolic 


* Since this paper was prepared, a paper has appeared in which A. C. Pipkin 
R. S. Rrviin [Arch, Rational Mech. Anal. 4, 129 (1959)] give the invariant 
system for a single matrix and a system of vectors. They treat the problem from 
a slightly different view point from that of the present paper, and their invariant 
system may be regarded as a special case of that given above. 
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method, the first term on the right-hand side of (5.7) may be written 
A; 5; 4; p Opy = Ay Gq p Ap d,4 + P= 2", (5.8) 


where Y’, Y”’ are polynomials in invariants of the second degree or less in the 
elements a;;,0,;, the final result following as a result of the Cayley-Hamilton 
theorem (4.3). Application of the symbolic process to the term A; b;; Aj, Ap, in 
(5.7) yields an expression for this quantity in terms of the invariant system (5.5) 
thereby completing the required reduction. A corresponding procedure applied 
to the invariants (iv) of (5.5) leads to the restrictions indicated upon the letters 
a, b, c, d (ADKINS 1960). 


From the system (5.5) we may, by using symbolic methods, construct an 
equivalent system of invariants of the types (5.1) in which the repeated suffices 
cover the full range 1, 2,3. We thus derive the alternative scheme 


Vig, UG Uje, = Mig Mei» (i) 

Gi Oss, Ag Ain Onir — Gig An Ons dys (4 +5), (ii) 

Wy, Ni, Hib, Uae, (49), (iii) 

Wi Din Cer, 105; Cin Cer, UG%8;C;n4,1 (4, 6,¢,d all different), (iv) 

Ae Aig Gin: ot A dag i ins (v) oe. 
A, 4;; 0,4 Cay (a= 0, Baal). (vi) 
A,B;, A; B;a;;, (vii) 

A; By aj 4 Oy; (A ee Ba == 0), (vii) 


in which (v) to (viii) follow directly from the corresponding invariants of (5.5) 
and (i) to (iv) have been given in the paper previously cited. 


In applying these results to restrict the form of the stress deformation relations, 
we first observe that under transformations of the x,-axes, each of the stress 
components t? may be regarded as a scalar invariant formed from the matrices 
C, E(=A,), A, (7=1, 2, ..., 2) and the vectors v, (s=1, 2,..., m) which is form 
invariant under the transformations (3.2), (3.14) and involves the elements of C 
linearly. We may therefore write 


M 
t=) Py ,, (5.10) 
Pia PO(CE, A, ©,)'; P, = 9, (E, Ap, Y) , (5.14) 


where Py , Q, are polynomial invariant functions of the arguments indicated and 
PY are lee and homogeneous in the elements of C (i.e. Ci). It follows that 
the functions gy, are polynomials in the invariants formed from (5.9) by replacing 
the coefficients a;;, b;;, ¢;;, d;; in every permissible manner by the corresponding 
elements of E, A, and the coefficients A;, B; similarly by elements of the 


vectors ,. 
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The polynomials g, thus become functions of the systems of invariants 


trA;, tr Al ic. AS (i) 

tA, Al tr AA tra A}, tr Aj Aj, (ii) 

[Aihi, [Aihu, (iii) 

[A; Ajit, [A? Aj|ii , [A; Alli (t =+-7), (iv) 
[A;A;Ayhi, [AjA;4ghi, [4i 4, Ahi, (5.12) * 


[A;A,4?),,,  [4;4,4?]1 (4,7, all different), (v) 
[A; A; A, Aj), Ly [A; A; A, A,)|; 1> [A; A), A; Aj), 1> 
[A; A, A, Ajhi1, [A; A, A; Aj|i1; [A; A; A, Aj)ii 


(7,7,k,/ all different), (vi) 


[eh , [v, Aj); , [v, A?) , ; (i) 
[v, A; Aj\, , [v, A; A*), , [v, A; A, A,|; ) (i1) 
irtre ire, Ag (iii) (5.13) 


17 U,.,) tO, OA; a te, Opa, 
(ry +s; 1,7,k all different), (iv) 


obtained by taking all possible choices of one, two, three or four as appropriate, 
of the letters 7,7,,/ from the numbers 0 to m (4)=E) and of 7, s from the 
numbers 1 to m, together with the invariants derived from (5.13) by all possible 
permutations of 7,7, 2. 

In (5.12) the symbol [ ],,; denotes the leading term of a matrix product 
corresponding to (ili) and (iv) of (5.9), in (5.13) [ ], indicates a product of types 
(v) and (vi) containing a single vector and with one suffix taking the value 
unity, while the last five terms of (5.13) are scalar products containing a pair 
of vectors. The meaning of the symbol tr has been extended to cover these terms, 
which correspond to (vii) and (viii) of (5.9), where summation is carried out 
over all suffixes. 

The coefficients Pi are derived by forming from (5.9) the invariants in C, 
E, A,,v, which involve C linearly. This yields the system 


Doe wrCA, trOdr (i) 

(Chi, ([CAihi, [C Ata, (ii) 
[CA:Ajhi, [CA;Ajii, [CA;Afhi, [CA;AThi  (@-F7), (iti) (5.14) 
[CA;4;A,]1,, [CA;A, Aji, [CA;A;Aghis, 

[CA;A,Aj]11, [CA,A;A,],,, [A,;C A;A,|1, (%,7, all different), (iv) 


* For the derivation of the restrictions on the invariants (v) and (vi) reference 
may be made to the earlier paper. 
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[v, Ch, ? [w, Cc Aj), ? [v, A; Ci, , [v, Cc A?), 5) (i) 
[w, CA; Ajh,, [w, CA; Aj), [v, A; CA), (5 15) 
[v,A;CA,],, [v,A;4;C],  (¢4), (ae * 


aCe i Ce. etry, t. CA; trv, 0, A.C (Fa 'S) eel aia) 


The coefficients (i) in (5.14) and (iii) in (5.15) are those appropriate to isotropic 
bodies and may be written in more familiar forms in terms of components referred 
to the convected system #. Remembering (2.4) and the definitions of § 2 we 
obtain 

trC=y", trCA,=0%,  trO Aaa} gis 

trev=v' yo! tre v,U,=F [u”? yi)7 4 alF ys) 4) 


trv, v, CA, = 3 [vo Pf 4 yl)! yi g 47, 


6. Polynomial invariants for rotational symmetry 


It is of interest to examine the forms of the additional invariants which would 
be required for a polynomzal representation of materials possessing only rotational 
symmetry. The symbolic representations of these are linear in (7 ¢) or (& 71) 
if formed from the elements (3.11) or linear in (€ 7 /) if formed from (3.12). We 
investigate the consequences of this latter representation. 

The additional invariants for rotational symmetry are readily constructed by 
noticing that 


(El) = 55285; Le = €up1 ba Np = Eup Sa Np> (6.1) 


where €;;,=+1 or —1 according as 7,7, k is an even or odd permutation of 
1, 2, 3 and is zero otherwise and ¢,,=€,8,. Symbolically, (6.1) may be written 


(52) = (e|&) (elm), 


with ¢,ég—=€,,- The additional invariants may therefore be expressed in terms 
of symbolic products of the types (3.16). As before, we obtain the forms (5.2), 
but in each invariant one of the elements a,,, bys, ... is replaced by é,,. Re- 
ductions of these invariants again follow from the results of § 4. In this way we 
derive the systems 


Exp 4py Oy (a + )), i) 


(i 
Ay x Exp f1 (a = 0), Ay x Exp Opy Cy1> (i1) (6.2) 
(i 
( 


Ar x Exp Ag, Fya Fa p Opy Ay, Ayn Ona py A,» iil) 


Agéxp Bp (A+B), Ag Exp My B,, iv) 


* The coefficient [v, 4; A; C], can be expressed in terms of the remaining coeffi- 
cients of (5.15) and is therefore excluded. This follows from the results obtained by 
RiIviin (1955) that the linear combination 


is expressible as a polynomial in matrix products of the second and lower degree. 


For the same reason, the six products [v, A; A; Aj] ,of (5.13) obtained by permuting 
i,j, k are also linearly related. 


corresponding to types (i), (ii), (iii) and (iv) respectively of (5.2). Any invari 
of higher degree than those listed in the coefficients a;;, bij, Cz, ++. May be 
reduced by means of (4.1). For example, the invariant 4 
ma 
Ayo Exp Opy Oye Ao 


of type (ii) may be written as 

ave Exp (Opy Cys Bogie ¥ 
where 

Be Now =4g1 Ua; 


and the reduction follows immediately from (4.1). 
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